Package org.apache.iceberg
Interface DataFile
-
- All Superinterfaces:
ContentFile<DataFile>
- All Known Implementing Classes:
SparkDataFile
public interface DataFile extends ContentFile<DataFile>
Interface for data files listed in a table manifest.
-
-
Field Summary
Fields Modifier and Type Field Description static Types.NestedFieldCOLUMN_SIZESstatic Types.NestedFieldCONTENTstatic Types.NestedFieldEQUALITY_IDSstatic Types.NestedFieldFILE_FORMATstatic Types.NestedFieldFILE_PATHstatic Types.NestedFieldFILE_SIZEstatic Types.NestedFieldKEY_METADATAstatic Types.NestedFieldLOWER_BOUNDSstatic Types.NestedFieldNAN_VALUE_COUNTSstatic Types.NestedFieldNULL_VALUE_COUNTSstatic java.lang.StringPARTITION_DOCstatic intPARTITION_IDstatic java.lang.StringPARTITION_NAMEstatic Types.NestedFieldRECORD_COUNTstatic Types.NestedFieldSORT_ORDER_IDstatic Types.NestedFieldSPEC_IDstatic Types.NestedFieldSPLIT_OFFSETSstatic Types.NestedFieldUPPER_BOUNDSstatic Types.NestedFieldVALUE_COUNTS
-
Method Summary
All Methods Static Methods Instance Methods Default Methods Modifier and Type Method Description default FileContentcontent()Returns type of content stored in the file; one of DATA, POSITION_DELETES, or EQUALITY_DELETES.default java.util.List<java.lang.Integer>equalityFieldIds()Returns the set of field IDs used for equality comparison, in equality delete files.static Types.StructTypegetType(Types.StructType partitionType)-
Methods inherited from interface org.apache.iceberg.ContentFile
columnSizes, copy, copy, copyWithoutStats, fileSizeInBytes, format, keyMetadata, lowerBounds, nanValueCounts, nullValueCounts, partition, path, pos, recordCount, sortOrderId, specId, splitOffsets, upperBounds, valueCounts
-
-
-
-
Field Detail
-
CONTENT
static final Types.NestedField CONTENT
-
FILE_PATH
static final Types.NestedField FILE_PATH
-
FILE_FORMAT
static final Types.NestedField FILE_FORMAT
-
RECORD_COUNT
static final Types.NestedField RECORD_COUNT
-
FILE_SIZE
static final Types.NestedField FILE_SIZE
-
COLUMN_SIZES
static final Types.NestedField COLUMN_SIZES
-
VALUE_COUNTS
static final Types.NestedField VALUE_COUNTS
-
NULL_VALUE_COUNTS
static final Types.NestedField NULL_VALUE_COUNTS
-
NAN_VALUE_COUNTS
static final Types.NestedField NAN_VALUE_COUNTS
-
LOWER_BOUNDS
static final Types.NestedField LOWER_BOUNDS
-
UPPER_BOUNDS
static final Types.NestedField UPPER_BOUNDS
-
KEY_METADATA
static final Types.NestedField KEY_METADATA
-
SPLIT_OFFSETS
static final Types.NestedField SPLIT_OFFSETS
-
EQUALITY_IDS
static final Types.NestedField EQUALITY_IDS
-
SORT_ORDER_ID
static final Types.NestedField SORT_ORDER_ID
-
SPEC_ID
static final Types.NestedField SPEC_ID
-
PARTITION_ID
static final int PARTITION_ID
- See Also:
- Constant Field Values
-
PARTITION_NAME
static final java.lang.String PARTITION_NAME
- See Also:
- Constant Field Values
-
PARTITION_DOC
static final java.lang.String PARTITION_DOC
- See Also:
- Constant Field Values
-
-
Method Detail
-
getType
static Types.StructType getType(Types.StructType partitionType)
-
content
default FileContent content()
Description copied from interface:ContentFileReturns type of content stored in the file; one of DATA, POSITION_DELETES, or EQUALITY_DELETES.- Specified by:
contentin interfaceContentFile<DataFile>- Returns:
- the content stored in the file; one of DATA, POSITION_DELETES, or EQUALITY_DELETES
-
equalityFieldIds
default java.util.List<java.lang.Integer> equalityFieldIds()
Description copied from interface:ContentFileReturns the set of field IDs used for equality comparison, in equality delete files.An equality delete file may contain additional data fields that are not used by equality comparison. The subset of columns in a delete file to be used in equality comparison are tracked by ID. Extra columns can be used to reconstruct changes and metrics from extra columns are used during job planning.
- Specified by:
equalityFieldIdsin interfaceContentFile<DataFile>- Returns:
- IDs of the fields used in equality comparison with the records in this delete file
-
-