Class RewriteDataFiles.Builder
java.lang.Object
org.apache.iceberg.flink.maintenance.api.MaintenanceTaskBuilder<RewriteDataFiles.Builder>
org.apache.iceberg.flink.maintenance.api.RewriteDataFiles.Builder
- Enclosing class:
- RewriteDataFiles
public static class RewriteDataFiles.Builder
extends MaintenanceTaskBuilder<RewriteDataFiles.Builder>
- 
Constructor SummaryConstructors
- 
Method SummaryModifier and TypeMethodDescriptionconfig(RewriteDataFilesConfig rewriteDataFilesConfig) Configures the properties for the rewriter.deleteFileThreshold(int deleteFileThreshold) Configures the minimum delete file number for a file after a rewrite is always initiated.filter(Expression newFilter) A user provided filter for determining which files will be considered by the rewrite strategy.maxFileGroupSizeBytes(long maxFileGroupSizeBytes) Configures the group size for rewriting.maxFileSizeBytes(long maxFileSizeBytes) Configures the max file size considered for rewriting.maxFilesToRewrite(int maxFilesToRewrite) Configures max files to rewrite.maxRewriteBytes(long newMaxRewriteBytes) Configures the maximum byte size of the rewrites for one scheduled compaction.minFileSizeBytes(long minFileSizeBytes) Configures the min file size considered for rewriting.minInputFiles(int minInputFiles) Configures the minimum file number after a rewrite is always initiated.partialProgressEnabled(boolean newPartialProgressEnabled) Allows committing compacted data files in batches.partialProgressMaxCommits(int newPartialProgressMaxCommits) Configures the size of batches ifpartialProgressEnabled.rewriteAll(boolean rewriteAll) Overrides other options and forces rewriting of all provided files.targetFileSizeBytes(long targetFileSizeBytes) Configures the target file size.Methods inherited from class org.apache.iceberg.flink.maintenance.api.MaintenanceTaskBuilderindex, operatorName, parallelism, parallelism, scheduleOnCommitCount, scheduleOnDataFileCount, scheduleOnDataFileSize, scheduleOnEqDeleteFileCount, scheduleOnEqDeleteRecordCount, scheduleOnInterval, scheduleOnPosDeleteFileCount, scheduleOnPosDeleteRecordCount, slotSharingGroup, slotSharingGroup, tableLoader, tableName, taskName, uidSuffix, uidSuffix
- 
Constructor Details- 
Builderpublic Builder()
 
- 
- 
Method Details- 
partialProgressEnabledAllows committing compacted data files in batches. SeeRewriteDataFiles.PARTIAL_PROGRESS_ENABLEDfor more details.- Parameters:
- newPartialProgressEnabled- to enable partial commits
 
- 
partialProgressMaxCommitsConfigures the size of batches ifpartialProgressEnabled. SeeRewriteDataFiles.PARTIAL_PROGRESS_MAX_COMMITSfor more details.- Parameters:
- newPartialProgressMaxCommits- to target number of the commits per run
 
- 
maxRewriteBytesConfigures the maximum byte size of the rewrites for one scheduled compaction. This could be used to limit the resources used by the compaction.- Parameters:
- newMaxRewriteBytes- to limit the size of the rewrites
 
- 
targetFileSizeBytesConfigures the target file size. SeeRewriteDataFiles.TARGET_FILE_SIZE_BYTESfor more details.- Parameters:
- targetFileSizeBytes- target file size
 
- 
minFileSizeBytesConfigures the min file size considered for rewriting. SeeSizeBasedFileRewritePlanner.MIN_FILE_SIZE_BYTESfor more details.- Parameters:
- minFileSizeBytes- min file size
 
- 
maxFileSizeBytesConfigures the max file size considered for rewriting. SeeSizeBasedFileRewritePlanner.MAX_FILE_SIZE_BYTESfor more details.- Parameters:
- maxFileSizeBytes- max file size
 
- 
minInputFilesConfigures the minimum file number after a rewrite is always initiated. See description seeSizeBasedFileRewritePlanner.MIN_INPUT_FILESfor more details.- Parameters:
- minInputFiles- min file number
 
- 
deleteFileThresholdConfigures the minimum delete file number for a file after a rewrite is always initiated. SeeBinPackRewriteFilePlanner.DELETE_FILE_THRESHOLDfor more details.- Parameters:
- deleteFileThreshold- min delete file number
 
- 
rewriteAllOverrides other options and forces rewriting of all provided files.- Parameters:
- rewriteAll- enables a full rewrite
 
- 
maxFileGroupSizeBytesConfigures the group size for rewriting. SeeSizeBasedFileRewritePlanner.MAX_FILE_GROUP_SIZE_BYTESfor more details.- Parameters:
- maxFileGroupSizeBytes- file group size for rewrite
 
- 
maxFilesToRewriteConfigures max files to rewrite. SeeBinPackRewriteFilePlanner.MAX_FILES_TO_REWRITEfor more details.- Parameters:
- maxFilesToRewrite- maximum files to rewrite
 
- 
filterA user provided filter for determining which files will be considered by the rewrite strategy.- Parameters:
- newFilter- the filter expression to apply
- Returns:
- this for method chaining
 
- 
configConfigures the properties for the rewriter.- Parameters:
- rewriteDataFilesConfig- properties for the rewriter
 
 
-