Package org.apache.iceberg.spark.actions
Class BaseRewriteDataFilesSpark3Action
- java.lang.Object
- 
- org.apache.iceberg.spark.actions.BaseRewriteDataFilesSpark3Action
 
- 
- All Implemented Interfaces:
- Action<RewriteDataFiles,RewriteDataFiles.Result>,- RewriteDataFiles,- SnapshotUpdate<RewriteDataFiles,RewriteDataFiles.Result>
 
 public class BaseRewriteDataFilesSpark3Action extends java.lang.Object
- 
- 
Nested Class Summary- 
Nested classes/interfaces inherited from interface org.apache.iceberg.actions.RewriteDataFilesRewriteDataFiles.FileGroupInfo, RewriteDataFiles.FileGroupRewriteResult, RewriteDataFiles.Result
 
- 
 - 
Field Summary- 
Fields inherited from interface org.apache.iceberg.actions.RewriteDataFilesMAX_CONCURRENT_FILE_GROUP_REWRITES, MAX_CONCURRENT_FILE_GROUP_REWRITES_DEFAULT, MAX_FILE_GROUP_SIZE_BYTES, MAX_FILE_GROUP_SIZE_BYTES_DEFAULT, PARTIAL_PROGRESS_ENABLED, PARTIAL_PROGRESS_ENABLED_DEFAULT, PARTIAL_PROGRESS_MAX_COMMITS, PARTIAL_PROGRESS_MAX_COMMITS_DEFAULT, TARGET_FILE_SIZE_BYTES
 
- 
 - 
Constructor SummaryConstructors Modifier Constructor Description protectedBaseRewriteDataFilesSpark3Action(org.apache.spark.sql.SparkSession spark, Table table)
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description RewriteDataFilesbinPack()Choose BINPACK as a strategy for this rewrite operationprotected BinPackStrategybinPackStrategy()The framework specificBinPackStrategyprotected org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>buildManifestFileDF(Table table)protected org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>buildManifestListDF(Table table)protected org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>buildOtherMetadataFileDF(Table table)protected org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>buildValidDataFileDF(Table table)protected org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>buildValidMetadataFileDF(Table table)protected voidcommit(SnapshotUpdate<?> update)RewriteDataFiles.Resultexecute()Executes this action.RewriteDataFilesfilter(Expression expression)A user provided filter for determining which files will be considered by the rewrite strategy.protected org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>loadMetadataTable(Table table, MetadataTableType type)protected JobGroupInfonewJobGroupInfo(java.lang.String groupId, java.lang.String desc)protected TablenewStaticTable(TableMetadata metadata, FileIO io)ThisToption(java.lang.String name, java.lang.String value)Configures this action with an extra option.protected java.util.Map<java.lang.String,java.lang.String>options()ThisToptions(java.util.Map<java.lang.String,java.lang.String> newOptions)Configures this action with extra options.protected RewriteDataFilesself()ThisTsnapshotProperty(java.lang.String property, java.lang.String value)Sets a summary property in the snapshot produced by this action.protected org.apache.spark.sql.SparkSessionspark()protected org.apache.spark.api.java.JavaSparkContextsparkContext()protected Tabletable()protected <T> TwithJobGroupInfo(JobGroupInfo info, java.util.function.Supplier<T> supplier)- 
Methods inherited from class java.lang.Objectclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 - 
Methods inherited from interface org.apache.iceberg.actions.SnapshotUpdatesnapshotProperty
 
- 
 
- 
- 
- 
Constructor Detail- 
BaseRewriteDataFilesSpark3Actionprotected BaseRewriteDataFilesSpark3Action(org.apache.spark.sql.SparkSession spark, Table table)
 
- 
 - 
Method Detail- 
binPackStrategyprotected BinPackStrategy binPackStrategy() The framework specificBinPackStrategy
 - 
selfprotected RewriteDataFiles self() 
 - 
tableprotected Table table() 
 - 
binPackpublic RewriteDataFiles binPack() Description copied from interface:RewriteDataFilesChoose BINPACK as a strategy for this rewrite operation- Specified by:
- binPackin interface- RewriteDataFiles
- Returns:
- this for method chaining
 
 - 
filterpublic RewriteDataFiles filter(Expression expression) Description copied from interface:RewriteDataFilesA user provided filter for determining which files will be considered by the rewrite strategy. This will be used in addition to whatever rules the rewrite strategy generates. For example this would be used for providing a restriction to only run rewrite on a specific partition.- Specified by:
- filterin interface- RewriteDataFiles
- Parameters:
- expression- An iceberg expression used to determine which files will be considered for rewriting
- Returns:
- this for chaining
 
 - 
executepublic RewriteDataFiles.Result execute() Description copied from interface:ActionExecutes this action.- Specified by:
- executein interface- Action<RewriteDataFiles,RewriteDataFiles.Result>
- Returns:
- the result of this action
 
 - 
snapshotPropertypublic ThisT snapshotProperty(java.lang.String property, java.lang.String value)Description copied from interface:SnapshotUpdateSets a summary property in the snapshot produced by this action.- Specified by:
- snapshotPropertyin interface- SnapshotUpdate<ThisT,R>
- Parameters:
- property- a snapshot property name
- value- a snapshot property value
- Returns:
- this for method chaining
 
 - 
commitprotected void commit(SnapshotUpdate<?> update) 
 - 
sparkprotected org.apache.spark.sql.SparkSession spark() 
 - 
sparkContextprotected org.apache.spark.api.java.JavaSparkContext sparkContext() 
 - 
optionpublic ThisT option(java.lang.String name, java.lang.String value)Description copied from interface:ActionConfigures this action with an extra option.Certain actions allow users to control internal details of their execution via options. 
 - 
optionspublic ThisT options(java.util.Map<java.lang.String,java.lang.String> newOptions) Description copied from interface:ActionConfigures this action with extra options.Certain actions allow users to control internal details of their execution via options. 
 - 
optionsprotected java.util.Map<java.lang.String,java.lang.String> options() 
 - 
withJobGroupInfoprotected <T> T withJobGroupInfo(JobGroupInfo info, java.util.function.Supplier<T> supplier) 
 - 
newJobGroupInfoprotected JobGroupInfo newJobGroupInfo(java.lang.String groupId, java.lang.String desc) 
 - 
newStaticTableprotected Table newStaticTable(TableMetadata metadata, FileIO io) 
 - 
buildValidDataFileDFprotected org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> buildValidDataFileDF(Table table) 
 - 
buildManifestFileDFprotected org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> buildManifestFileDF(Table table) 
 - 
buildManifestListDFprotected org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> buildManifestListDF(Table table) 
 - 
buildOtherMetadataFileDFprotected org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> buildOtherMetadataFileDF(Table table) 
 - 
buildValidMetadataFileDFprotected org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> buildValidMetadataFileDF(Table table) 
 - 
loadMetadataTableprotected org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> loadMetadataTable(Table table, MetadataTableType type) 
 
- 
 
-