Package org.apache.iceberg.actions
Class SizeBasedDataRewriter
- java.lang.Object
- 
- org.apache.iceberg.actions.SizeBasedFileRewriter<FileScanTask,DataFile>
- 
- org.apache.iceberg.actions.SizeBasedDataRewriter
 
 
- 
- All Implemented Interfaces:
- FileRewriter<FileScanTask,DataFile>
 
 public abstract class SizeBasedDataRewriter extends SizeBasedFileRewriter<FileScanTask,DataFile> 
- 
- 
Field SummaryFields Modifier and Type Field Description static java.lang.StringDELETE_FILE_THRESHOLDThe minimum number of deletes that needs to be associated with a data file for it to be considered for rewriting.static intDELETE_FILE_THRESHOLD_DEFAULT- 
Fields inherited from class org.apache.iceberg.actions.SizeBasedFileRewriterMAX_FILE_GROUP_SIZE_BYTES, MAX_FILE_GROUP_SIZE_BYTES_DEFAULT, MAX_FILE_SIZE_BYTES, MAX_FILE_SIZE_DEFAULT_RATIO, MIN_FILE_SIZE_BYTES, MIN_FILE_SIZE_DEFAULT_RATIO, MIN_INPUT_FILES, MIN_INPUT_FILES_DEFAULT, REWRITE_ALL, REWRITE_ALL_DEFAULT, TARGET_FILE_SIZE_BYTES
 
- 
 - 
Constructor SummaryConstructors Modifier Constructor Description protectedSizeBasedDataRewriter(Table table)
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description protected longdefaultTargetFileSize()protected java.lang.Iterable<java.util.List<FileScanTask>>filterFileGroups(java.util.List<java.util.List<FileScanTask>> groups)protected java.lang.Iterable<FileScanTask>filterFiles(java.lang.Iterable<FileScanTask> tasks)voidinit(java.util.Map<java.lang.String,java.lang.String> options)Initializes this rewriter using provided options.java.util.Set<java.lang.String>validOptions()Returns a set of supported options for this rewriter.- 
Methods inherited from class org.apache.iceberg.actions.SizeBasedFileRewriterenoughContent, enoughInputFiles, inputSize, numOutputFiles, planFileGroups, splitSize, table, tooMuchContent, writeMaxFileSize, wronglySized
 - 
Methods inherited from class java.lang.Objectclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 - 
Methods inherited from interface org.apache.iceberg.actions.FileRewriterdescription, rewrite
 
- 
 
- 
- 
- 
Field Detail- 
DELETE_FILE_THRESHOLDpublic static final java.lang.String DELETE_FILE_THRESHOLD The minimum number of deletes that needs to be associated with a data file for it to be considered for rewriting. If a data file has this number of deletes or more, it will be rewritten regardless of its file size determined bySizeBasedFileRewriter.MIN_FILE_SIZE_BYTESandSizeBasedFileRewriter.MAX_FILE_SIZE_BYTES. If a file group contains a file that satisfies this condition, the file group will be rewritten regardless of the number of files in the file group determined bySizeBasedFileRewriter.MIN_INPUT_FILES.Defaults to Integer.MAX_VALUE, which means this feature is not enabled by default. - See Also:
- Constant Field Values
 
 - 
DELETE_FILE_THRESHOLD_DEFAULTpublic static final int DELETE_FILE_THRESHOLD_DEFAULT - See Also:
- Constant Field Values
 
 
- 
 - 
Constructor Detail- 
SizeBasedDataRewriterprotected SizeBasedDataRewriter(Table table) 
 
- 
 - 
Method Detail- 
validOptionspublic java.util.Set<java.lang.String> validOptions() Description copied from interface:FileRewriterReturns a set of supported options for this rewriter. Only options specified in this list will be accepted at runtime. Any other options will be rejected.- Specified by:
- validOptionsin interface- FileRewriter<FileScanTask,DataFile>
- Overrides:
- validOptionsin class- SizeBasedFileRewriter<FileScanTask,DataFile>
 
 - 
initpublic void init(java.util.Map<java.lang.String,java.lang.String> options) Description copied from interface:FileRewriterInitializes this rewriter using provided options.- Specified by:
- initin interface- FileRewriter<FileScanTask,DataFile>
- Overrides:
- initin class- SizeBasedFileRewriter<FileScanTask,DataFile>
- Parameters:
- options- options to initialize this rewriter
 
 - 
filterFilesprotected java.lang.Iterable<FileScanTask> filterFiles(java.lang.Iterable<FileScanTask> tasks) - Specified by:
- filterFilesin class- SizeBasedFileRewriter<FileScanTask,DataFile>
 
 - 
filterFileGroupsprotected java.lang.Iterable<java.util.List<FileScanTask>> filterFileGroups(java.util.List<java.util.List<FileScanTask>> groups) - Specified by:
- filterFileGroupsin class- SizeBasedFileRewriter<FileScanTask,DataFile>
 
 - 
defaultTargetFileSizeprotected long defaultTargetFileSize() - Specified by:
- defaultTargetFileSizein class- SizeBasedFileRewriter<FileScanTask,DataFile>
 
 
- 
 
-