java.lang.Object
- org.apache.iceberg.actions.BinPackStrategy
- - org.apache.iceberg.actions.SortStrategy
  - - org.apache.iceberg.spark.actions.Spark3SortStrategy

All Implemented Interfaces:

java.io.Serializable, RewriteStrategy
```
public class Spark3SortStrategy
extends SortStrategy
```
See Also:

Serialized Form

Field Summary

Fields
Modifier and Type	Field	Description
`static java.lang.String`	`COMPRESSION_FACTOR`	The number of shuffle partitions and consequently the number of output files created by the Spark Sort is based on the size of the input data files used in this rewrite operation.

Fields inherited from class org.apache.iceberg.actions.SortStrategy
REWRITE_ALL, REWRITE_ALL_DEFAULT

Fields inherited from class org.apache.iceberg.actions.BinPackStrategy
DELETE_FILE_THRESHOLD, DELETE_FILE_THRESHOLD_DEFAULT, MAX_FILE_SIZE_BYTES, MAX_FILE_SIZE_DEFAULT_RATIO, MIN_FILE_SIZE_BYTES, MIN_FILE_SIZE_DEFAULT_RATIO, MIN_INPUT_FILES, MIN_INPUT_FILES_DEFAULT

Constructor Summary

Constructors
Constructor Description

Spark3SortStrategy(Table table, org.apache.spark.sql.SparkSession spark)

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method	Description
`RewriteStrategy`	`options(java.util.Map<java.lang.String,java.lang.String> options)`	Sets options to be used with this strategy
`java.util.Set<DataFile>`	`rewriteFiles(java.util.List<FileScanTask> filesToRewrite)`	Method which will rewrite files based on this particular RewriteStrategy's algorithm.
`protected org.apache.spark.sql.catalyst.plans.logical.LogicalPlan`	`sortPlan(org.apache.spark.sql.connector.distributions.Distribution distribution, org.apache.spark.sql.connector.expressions.SortOrder[] ordering, org.apache.spark.sql.catalyst.plans.logical.LogicalPlan plan, org.apache.spark.sql.internal.SQLConf conf)`
`protected org.apache.spark.sql.SparkSession`	`spark()`
`Table`	`table()`	Returns the table being modified by this rewrite strategy
`java.util.Set<java.lang.String>`	`validOptions()`	Returns a set of options which this rewrite strategy can use.

Methods inherited from class org.apache.iceberg.actions.SortStrategy
name, planFileGroups, selectFilesToRewrite, sortOrder, sortOrder, validateOptions

Methods inherited from class org.apache.iceberg.actions.BinPackStrategy
inputFileSize, maxGroupSize, numOutputFiles, splitSize, targetFileSize, writeMaxFileSize

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Field Detail
  - COMPRESSION_FACTOR
```
public static final java.lang.String COMPRESSION_FACTOR
```
    The number of shuffle partitions and consequently the number of output files created by the Spark Sort is based on the size of the input data files used in this rewrite operation. Due to compression, the disk file sizes may not accurately represent the size of files in the output. This parameter lets the user adjust the file size used for estimating actual output data size. A factor greater than 1.0 would generate more files than we would expect based on the on-disk file size. A value less than 1.0 would create fewer files than we would expect due to the on-disk size.
    
    See Also:
    
    Constant Field Values
- Constructor Detail
  - Spark3SortStrategy
```
public Spark3SortStrategy(Table table,
                          org.apache.spark.sql.SparkSession spark)
```
- Method Detail
  - table
```
public Table table()
```
    Description copied from interface: RewriteStrategy
    
    Returns the table being modified by this rewrite strategy
  - validOptions
```
public java.util.Set<java.lang.String> validOptions()
```
    Description copied from interface: RewriteStrategy
    
    Returns a set of options which this rewrite strategy can use. This is an allowed-list and any options not specified here will be rejected at runtime.
    
    Specified by:
    
    validOptions in interface RewriteStrategy
    
    Overrides:
    
    validOptions in class SortStrategy
  - options
```
public RewriteStrategy options(java.util.Map<java.lang.String,java.lang.String> options)
```
    Description copied from interface: RewriteStrategy
    
    Sets options to be used with this strategy
    
    Specified by:
    
    options in interface RewriteStrategy
    
    Overrides:
    
    options in class SortStrategy
  - rewriteFiles
```
public java.util.Set<DataFile> rewriteFiles(java.util.List<FileScanTask> filesToRewrite)
```
    Description copied from interface: RewriteStrategy
    
    Method which will rewrite files based on this particular RewriteStrategy's algorithm. This will most likely be Action framework specific (Spark/Presto/Flink ....).
    
    Parameters:
    
    filesToRewrite - a group of files to be rewritten together
    
    Returns:
    
    a set of newly written files
  - spark
```
protected org.apache.spark.sql.SparkSession spark()
```
  - sortPlan
```
protected org.apache.spark.sql.catalyst.plans.logical.LogicalPlan sortPlan(org.apache.spark.sql.connector.distributions.Distribution distribution,
                                                                           org.apache.spark.sql.connector.expressions.SortOrder[] ordering,
                                                                           org.apache.spark.sql.catalyst.plans.logical.LogicalPlan plan,
                                                                           org.apache.spark.sql.internal.SQLConf conf)
```

Class Spark3SortStrategy

Field Summary

Fields inherited from class org.apache.iceberg.actions.SortStrategy

Fields inherited from class org.apache.iceberg.actions.BinPackStrategy

Constructor Summary

Method Summary

Methods inherited from class org.apache.iceberg.actions.SortStrategy

Methods inherited from class org.apache.iceberg.actions.BinPackStrategy

Methods inherited from class java.lang.Object

Field Detail

COMPRESSION_FACTOR

Constructor Detail

Spark3SortStrategy

Method Detail

table

validOptions

options

rewriteFiles

spark

sortPlan