public class SparkDataFile extends SparkContentFile<DataFile> implements DataFile
COLUMN_SIZES, CONTENT, EQUALITY_IDS, FILE_FORMAT, FILE_PATH, FILE_SIZE, KEY_METADATA, LOWER_BOUNDS, NAN_VALUE_COUNTS, NULL_VALUE_COUNTS, PARTITION_DOC, PARTITION_ID, PARTITION_NAME, RECORD_COUNT, SORT_ORDER_ID, SPEC_ID, SPLIT_OFFSETS, UPPER_BOUNDS, VALUE_COUNTS
Constructor and Description |
---|
SparkDataFile(Types.StructType type,
org.apache.spark.sql.types.StructType sparkType) |
SparkDataFile(Types.StructType type,
Types.StructType projectedType,
org.apache.spark.sql.types.StructType sparkType) |
Modifier and Type | Method and Description |
---|---|
protected DataFile |
asFile() |
java.util.List<java.lang.Integer> |
equalityFieldIds()
Returns the set of field IDs used for equality comparison, in equality delete files.
|
columnSizes, content, copy, copyWithoutStats, fileSizeInBytes, format, keyMetadata, lowerBounds, nanValueCounts, nullValueCounts, partition, path, pos, recordCount, sortOrderId, specId, splitOffsets, upperBounds, valueCounts, wrap
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
columnSizes, copy, copy, copyWithoutStats, copyWithStats, dataSequenceNumber, fileSequenceNumber, fileSizeInBytes, format, keyMetadata, lowerBounds, nanValueCounts, nullValueCounts, partition, path, pos, recordCount, sortOrderId, specId, splitOffsets, upperBounds, valueCounts
public SparkDataFile(Types.StructType type, org.apache.spark.sql.types.StructType sparkType)
public SparkDataFile(Types.StructType type, Types.StructType projectedType, org.apache.spark.sql.types.StructType sparkType)
protected DataFile asFile()
asFile
in class SparkContentFile<DataFile>
public java.util.List<java.lang.Integer> equalityFieldIds()
ContentFile
An equality delete file may contain additional data fields that are not used by equality comparison. The subset of columns in a delete file to be used in equality comparison are tracked by ID. Extra columns can be used to reconstruct changes and metrics from extra columns are used during job planning.
equalityFieldIds
in interface ContentFile<DataFile>
equalityFieldIds
in interface DataFile
equalityFieldIds
in class SparkContentFile<DataFile>