Class MaintenanceTaskBuilder<T extends MaintenanceTaskBuilder<?>>
java.lang.Object
org.apache.iceberg.flink.maintenance.api.MaintenanceTaskBuilder<T>
- Direct Known Subclasses:
- DeleteOrphanFiles.Builder,- ExpireSnapshots.Builder,- RewriteDataFiles.Builder
@Experimental
public abstract class MaintenanceTaskBuilder<T extends MaintenanceTaskBuilder<?>>
extends Object
- 
Constructor SummaryConstructors
- 
Method SummaryModifier and TypeMethodDescriptionprotected intindex()protected StringoperatorName(String operatorNameBase) protected Integerparallelism(int newParallelism) Sets the parallelism for the stream.scheduleOnCommitCount(int commitCount) After a given number of Iceberg table commits since the last run, starts the downstream job.scheduleOnDataFileCount(int dataFileCount) After a given number of new data files since the last run, starts the downstream job.scheduleOnDataFileSize(long dataFileSizeInBytes) After a given aggregated data file size since the last run, starts the downstream job.scheduleOnEqDeleteFileCount(int eqDeleteFileCount) After a given number of new equality delete files since the last run, starts the downstream job.scheduleOnEqDeleteRecordCount(long eqDeleteRecordCount) After a given number of new equality delete records since the last run, starts the downstream job.scheduleOnInterval(Duration interval) After a given time since the last run, starts the downstream job.scheduleOnPosDeleteFileCount(int posDeleteFileCount) After a given number of new positional delete files since the last run, starts the downstream job.scheduleOnPosDeleteRecordCount(long posDeleteRecordCount) After a given number of new positional delete records since the last run, starts the downstream job.protected StringslotSharingGroup(String newSlotSharingGroup) TheSingleOutputStreamOperator.slotSharingGroup(String)for all the operators of the generated stream.protected TableLoaderprotected Stringprotected StringtaskName()protected StringThe suffix used for the generatedTransformation's uid.
- 
Constructor Details- 
MaintenanceTaskBuilderpublic MaintenanceTaskBuilder()
 
- 
- 
Method Details- 
scheduleOnCommitCountAfter a given number of Iceberg table commits since the last run, starts the downstream job.- Parameters:
- commitCount- after the downstream job should be started
 
- 
scheduleOnDataFileCountAfter a given number of new data files since the last run, starts the downstream job.- Parameters:
- dataFileCount- after the downstream job should be started
 
- 
scheduleOnDataFileSizeAfter a given aggregated data file size since the last run, starts the downstream job.- Parameters:
- dataFileSizeInBytes- after the downstream job should be started
 
- 
scheduleOnPosDeleteFileCountAfter a given number of new positional delete files since the last run, starts the downstream job.- Parameters:
- posDeleteFileCount- after the downstream job should be started
 
- 
scheduleOnPosDeleteRecordCountAfter a given number of new positional delete records since the last run, starts the downstream job.- Parameters:
- posDeleteRecordCount- after the downstream job should be started
 
- 
scheduleOnEqDeleteFileCountAfter a given number of new equality delete files since the last run, starts the downstream job.- Parameters:
- eqDeleteFileCount- after the downstream job should be started
 
- 
scheduleOnEqDeleteRecordCountAfter a given number of new equality delete records since the last run, starts the downstream job.- Parameters:
- eqDeleteRecordCount- after the downstream job should be started
 
- 
scheduleOnIntervalAfter a given time since the last run, starts the downstream job.- Parameters:
- interval- after the downstream job should be started
 
- 
uidSuffixThe suffix used for the generatedTransformation's uid.- Parameters:
- newUidSuffix- for the transformations
 
- 
slotSharingGroupTheSingleOutputStreamOperator.slotSharingGroup(String)for all the operators of the generated stream. Could be used to separate the resources used by this task.- Parameters:
- newSlotSharingGroup- to be used for the operators
 
- 
parallelismSets the parallelism for the stream.- Parameters:
- newParallelism- the required parallelism
 
- 
indexprotected int index()
- 
taskName
- 
tableName
- 
tableLoader
- 
uidSuffix
- 
slotSharingGroup
- 
parallelism
- 
operatorName
 
-