public class SparkDataFile extends SparkContentFile<DataFile> implements DataFile
COLUMN_SIZES, CONTENT, EQUALITY_IDS, FILE_FORMAT, FILE_PATH, FILE_SIZE, KEY_METADATA, LOWER_BOUNDS, NAN_VALUE_COUNTS, NULL_VALUE_COUNTS, PARTITION_DOC, PARTITION_ID, PARTITION_NAME, RECORD_COUNT, SORT_ORDER_ID, SPEC_ID, SPLIT_OFFSETS, UPPER_BOUNDS, VALUE_COUNTS| Constructor and Description |
|---|
SparkDataFile(Types.StructType type,
org.apache.spark.sql.types.StructType sparkType) |
SparkDataFile(Types.StructType type,
Types.StructType projectedType,
org.apache.spark.sql.types.StructType sparkType) |
| Modifier and Type | Method and Description |
|---|---|
protected DataFile |
asFile() |
java.util.List<java.lang.Integer> |
equalityFieldIds()
Returns the set of field IDs used for equality comparison, in equality delete files.
|
columnSizes, content, copy, copyWithoutStats, fileSizeInBytes, format, keyMetadata, lowerBounds, nanValueCounts, nullValueCounts, partition, path, pos, recordCount, sortOrderId, specId, splitOffsets, upperBounds, valueCounts, wrapclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitcolumnSizes, copy, copy, copyWithoutStats, copyWithStats, dataSequenceNumber, fileSequenceNumber, fileSizeInBytes, format, keyMetadata, lowerBounds, nanValueCounts, nullValueCounts, partition, path, pos, recordCount, sortOrderId, specId, splitOffsets, upperBounds, valueCountspublic SparkDataFile(Types.StructType type, org.apache.spark.sql.types.StructType sparkType)
public SparkDataFile(Types.StructType type, Types.StructType projectedType, org.apache.spark.sql.types.StructType sparkType)
protected DataFile asFile()
asFile in class SparkContentFile<DataFile>public java.util.List<java.lang.Integer> equalityFieldIds()
ContentFileAn equality delete file may contain additional data fields that are not used by equality comparison. The subset of columns in a delete file to be used in equality comparison are tracked by ID. Extra columns can be used to reconstruct changes and metrics from extra columns are used during job planning.
equalityFieldIds in interface ContentFile<DataFile>equalityFieldIds in interface DataFileequalityFieldIds in class SparkContentFile<DataFile>