Class BaseRewriteManifests
- All Implemented Interfaces:
- PendingUpdate<Snapshot>,- RewriteManifests,- SnapshotUpdate<RewriteManifests>
- 
Method SummaryModifier and TypeMethodDescriptionaddManifest(ManifestFile manifest) Adds amanifest fileto the table.apply()Apply the pending changes and return the uncommitted changes for validation.apply(TableMetadata base, Snapshot snapshot) Apply the update's changes to the given metadata and snapshot.protected booleanprotected voidcleanAll()protected voidcleanUncommitted(Set<ManifestFile> committed) Clean up any uncommitted manifests that were created.protected booleanGroups an existingDataFileby a cluster key produced by a function.voidcommit()Apply the pending changes and commit.protected CommitMetricsprotected TableMetadatacurrent()protected voiddeleteFile(String path) deleteManifest(ManifestFile manifest) Deletes amanifest filefrom the table.deleteWith(Consumer<String> deleteCallback) Set a callback to delete files instead of the table's default.protected OutputFileprotected ManifestReader<DeleteFile> newDeleteManifestReader(ManifestFile manifest) protected ManifestWriter<DeleteFile> protected EncryptedOutputFileprotected ManifestReader<DataFile> newManifestReader(ManifestFile manifest) protected ManifestWriter<DataFile> protected RollingManifestWriter<DeleteFile> protected RollingManifestWriter<DataFile> protected StringA string that describes the action that produced the new snapshot.protected TableOperationsops()protected TableMetadatarefresh()protected RewriteManifestsreportWith(MetricsReporter newReporter) rewriteIf(Predicate<ManifestFile> pred) Determines which existingManifestFilefor the table should be rewritten.scanManifestsWith(ExecutorService executorService) Use a particular executor to scan manifests.protected RewriteManifestsself()Set a summary property in the snapshot produced by this update.protected longCalled to stage a snapshot in table metadata, but not update the current snapshot id.summary()protected Stringprotected voidtargetBranch(String branch) A setter for the target branch on which snapshot producer operation should be performedGenerates update event to notify about metadata changesprotected voidvalidate(TableMetadata currentMetadata, Snapshot snapshot) Validate the current metadata.protected ExecutorServiceprotected List<ManifestFile> writeDataManifests(Collection<DataFile> files, Long dataSeq, PartitionSpec spec) protected List<ManifestFile> writeDataManifests(Collection<DataFile> files, PartitionSpec spec) protected List<ManifestFile> writeDeleteManifests(Collection<DeleteFile> files, PartitionSpec spec) Methods inherited from class java.lang.Objectclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.apache.iceberg.PendingUpdateapply, commitMethods inherited from interface org.apache.iceberg.SnapshotUpdatedeleteWith, scanManifestsWith, stageOnly, toBranch
- 
Method Details- 
self
- 
operationA string that describes the action that produced the new snapshot.- Returns:
- a string operation
 
- 
setDescription copied from interface:SnapshotUpdateSet a summary property in the snapshot produced by this update.- Specified by:
- setin interface- SnapshotUpdate<RewriteManifests>
- Parameters:
- property- a String property name
- value- a String property value
- Returns:
- this for method chaining
 
- 
summary
- 
clusterByDescription copied from interface:RewriteManifestsGroups an existingDataFileby a cluster key produced by a function. The cluster key will determine which data file will be associated with a particular manifest. All data files with the same cluster key will be written to the same manifest (unless the file is large and split into multiple files). Manifests deleted viaRewriteManifests.deleteManifest(ManifestFile)or added viaRewriteManifests.addManifest(ManifestFile)are ignored during the rewrite process.- Specified by:
- clusterByin interface- RewriteManifests
- Parameters:
- func- Function used to cluster data files to manifests.
- Returns:
- this for method chaining
 
- 
rewriteIfDescription copied from interface:RewriteManifestsDetermines which existingManifestFilefor the table should be rewritten. Manifests that do not match the predicate are kept as-is. If this is not called and no predicate is set, then all manifests will be rewritten.- Specified by:
- rewriteIfin interface- RewriteManifests
- Parameters:
- pred- Predicate used to determine which manifests to rewrite. If true then the manifest file will be included for rewrite. If false then the manifest is kept as-is.
- Returns:
- this for method chaining
 
- 
deleteManifestDescription copied from interface:RewriteManifestsDeletes amanifest filefrom the table.- Specified by:
- deleteManifestin interface- RewriteManifests
- Parameters:
- manifest- a manifest to delete
- Returns:
- this for method chaining
 
- 
addManifestDescription copied from interface:RewriteManifestsAdds amanifest fileto the table. The added manifest cannot contain new or deleted files.By default, the manifest will be rewritten to ensure all entries have explicit snapshot IDs. In that case, it is always the responsibility of the caller to manage the lifecycle of the original manifest. If manifest entries are allowed to inherit the snapshot ID assigned on commit, the manifest should never be deleted manually if the commit succeeds as it will become part of the table metadata and will be cleaned up on expiry. If the manifest gets merged with others while preparing a new snapshot, it will be deleted automatically if this operation is successful. If the commit fails, the manifest will never be deleted and it is up to the caller whether to delete or reuse it. - Specified by:
- addManifestin interface- RewriteManifests
- Parameters:
- manifest- a manifest to add
- Returns:
- this for method chaining
 
- 
applyApply the update's changes to the given metadata and snapshot. Return the new manifest list.- Parameters:
- base- the base table metadata to apply changes to
- snapshot- snapshot to apply the changes to
- Returns:
- a manifest list for the new snapshot.
 
- 
updateEventDescription copied from interface:PendingUpdateGenerates update event to notify about metadata changes- Specified by:
- updateEventin interface- PendingUpdate<Snapshot>
- Returns:
- the generated event
 
- 
cleanUncommittedClean up any uncommitted manifests that were created.Manifests may not be committed if apply is called more because a commit conflict has occurred. Implementations may keep around manifests because the same changes will be made by both apply calls. This method instructs the implementation to clean up those manifests and passes the paths of the manifests that were actually committed. - Parameters:
- committed- a set of manifest paths that were actually committed
 
- 
stageOnlyDescription copied from interface:SnapshotUpdateCalled to stage a snapshot in table metadata, but not update the current snapshot id.- Specified by:
- stageOnlyin interface- SnapshotUpdate<ThisT>
- Returns:
- this for method chaining
 
- 
scanManifestsWithDescription copied from interface:SnapshotUpdateUse a particular executor to scan manifests. The default worker pool will be used by default.- Specified by:
- scanManifestsWithin interface- SnapshotUpdate<ThisT>
- Parameters:
- executorService- the provided executor
- Returns:
- this for method chaining
 
- 
ops
- 
commitMetrics
- 
reportWith
- 
targetBranchA setter for the target branch on which snapshot producer operation should be performed- Parameters:
- branch- to set as target branch
 
- 
targetBranch
- 
workerPool
- 
deleteWithDescription copied from interface:SnapshotUpdateSet a callback to delete files instead of the table's default.- Specified by:
- deleteWithin interface- SnapshotUpdate<ThisT>
- Parameters:
- deleteCallback- a String consumer used to delete locations.
- Returns:
- this for method chaining
 
- 
validateValidate the current metadata.Child operations can override this to add custom validation. - Parameters:
- currentMetadata- current table metadata to validate
- snapshot- ending snapshot on the lineage which is being validated
 
- 
applyDescription copied from interface:PendingUpdateApply the pending changes and return the uncommitted changes for validation.This does not result in a permanent update. - Specified by:
- applyin interface- PendingUpdate<ThisT>
- Returns:
- the uncommitted changes that would be committed by calling PendingUpdate.commit()
 
- 
current
- 
refresh
- 
commitpublic void commit()Description copied from interface:PendingUpdateApply the pending changes and commit.Changes are committed by calling the underlying table's commit method. Once the commit is successful, the updated table will be refreshed. - Specified by:
- commitin interface- PendingUpdate<ThisT>
 
- 
cleanAllprotected void cleanAll()
- 
deleteFile
- 
manifestListPath
- 
newManifestOutputFile
- 
newManifestWriter
- 
newDeleteManifestWriter
- 
newRollingManifestWriter
- 
newRollingDeleteManifestWriter
- 
newManifestReader
- 
newDeleteManifestReader
- 
snapshotIdprotected long snapshotId()
- 
canInheritSnapshotIdprotected boolean canInheritSnapshotId()
- 
cleanupAfterCommitprotected boolean cleanupAfterCommit()
- 
writeDataManifests
- 
writeDataManifestsprotected List<ManifestFile> writeDataManifests(Collection<DataFile> files, Long dataSeq, PartitionSpec spec) 
- 
writeDeleteManifests
 
-