Package org.apache.iceberg.spark
Class SparkDataFile
- All Implemented Interfaces:
ContentFile<DataFile>
,DataFile
-
Field Summary
Fields inherited from interface org.apache.iceberg.DataFile
COLUMN_SIZES, CONTENT, EQUALITY_IDS, FILE_FORMAT, FILE_PATH, FILE_SIZE, KEY_METADATA, LOWER_BOUNDS, NAN_VALUE_COUNTS, NULL_VALUE_COUNTS, PARTITION_DOC, PARTITION_ID, PARTITION_NAME, RECORD_COUNT, SORT_ORDER_ID, SPEC_ID, SPLIT_OFFSETS, UPPER_BOUNDS, VALUE_COUNTS
-
Constructor Summary
ConstructorDescriptionSparkDataFile
(Types.StructType type, Types.StructType projectedType, org.apache.spark.sql.types.StructType sparkType) SparkDataFile
(Types.StructType type, org.apache.spark.sql.types.StructType sparkType) -
Method Summary
Methods inherited from class org.apache.iceberg.spark.SparkContentFile
columnSizes, content, copy, copyWithoutStats, fileSizeInBytes, format, keyMetadata, lowerBounds, nanValueCounts, nullValueCounts, partition, path, pos, recordCount, sortOrderId, specId, splitOffsets, upperBounds, valueCounts, wrap
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Methods inherited from interface org.apache.iceberg.ContentFile
columnSizes, copy, copy, copyWithoutStats, copyWithStats, dataSequenceNumber, fileSequenceNumber, fileSizeInBytes, format, keyMetadata, location, lowerBounds, manifestLocation, nanValueCounts, nullValueCounts, partition, path, pos, recordCount, sortOrderId, specId, splitOffsets, upperBounds, valueCounts
-
Constructor Details
-
SparkDataFile
-
SparkDataFile
public SparkDataFile(Types.StructType type, Types.StructType projectedType, org.apache.spark.sql.types.StructType sparkType)
-
-
Method Details
-
asFile
- Specified by:
asFile
in classSparkContentFile<DataFile>
-
equalityFieldIds
Description copied from interface:ContentFile
Returns the set of field IDs used for equality comparison, in equality delete files.An equality delete file may contain additional data fields that are not used by equality comparison. The subset of columns in a delete file to be used in equality comparison are tracked by ID. Extra columns can be used to reconstruct changes and metrics from extra columns are used during job planning.
- Specified by:
equalityFieldIds
in interfaceContentFile<DataFile>
- Specified by:
equalityFieldIds
in interfaceDataFile
- Overrides:
equalityFieldIds
in classSparkContentFile<DataFile>
- Returns:
- IDs of the fields used in equality comparison with the records in this delete file
-