Package org.apache.iceberg.spark
Class SparkContentFile<F>
java.lang.Object
org.apache.iceberg.spark.SparkContentFile<F>
- All Implemented Interfaces:
- ContentFile<F>
- Direct Known Subclasses:
- SparkDataFile,- SparkDeleteFile
- 
Method SummaryModifier and TypeMethodDescriptionprotected abstract FasFile()Returns if collected, map from column ID to the size of the column in bytes, null otherwise.content()Returns type of content stored in the file; one of DATA, POSITION_DELETES, or EQUALITY_DELETES.copy()Copies this file.Copies this file without file stats.Returns the set of field IDs used for equality comparison, in equality delete files.longReturns the file size in bytes.format()Returns format of the file.Returns metadata about how this file is encrypted, or null if the file is stored in plain text.Returns if collected, map from column ID to value lower bounds, null otherwise.Returns if collected, map from column ID to its NaN value count, null otherwise.Returns if collected, map from column ID to its null value count, null otherwise.Returns partition for this file as aStructLike.path()Returns fully qualified path to the file, suitable for constructing a Hadoop Path.pos()Returns the ordinal position of the file in a manifest, or null if it was not read from a manifest.longReturns the number of top-level records in the file.Returns the sort order id of this file, which describes how the file is ordered.intspecId()Returns id of the partition spec used for partition metadata.Returns list of recommended split locations, if applicable, null otherwise.Returns if collected, map from column ID to value upper bounds, null otherwise.Returns if collected, map from column ID to the count of its values (including null and NaN values), null otherwise.wrap(org.apache.spark.sql.Row row) Methods inherited from class java.lang.Objectclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.apache.iceberg.ContentFilecopy, copyWithStats, dataSequenceNumber, fileSequenceNumber, firstRowId, location, manifestLocation
- 
Method Details- 
wrap
- 
asFile
- 
posDescription copied from interface:ContentFileReturns the ordinal position of the file in a manifest, or null if it was not read from a manifest.- Specified by:
- posin interface- ContentFile<F>
 
- 
specIdpublic int specId()Description copied from interface:ContentFileReturns id of the partition spec used for partition metadata.- Specified by:
- specIdin interface- ContentFile<F>
 
- 
contentDescription copied from interface:ContentFileReturns type of content stored in the file; one of DATA, POSITION_DELETES, or EQUALITY_DELETES.- Specified by:
- contentin interface- ContentFile<F>
 
- 
pathDescription copied from interface:ContentFileReturns fully qualified path to the file, suitable for constructing a Hadoop Path.- Specified by:
- pathin interface- ContentFile<F>
 
- 
formatDescription copied from interface:ContentFileReturns format of the file.- Specified by:
- formatin interface- ContentFile<F>
 
- 
partitionDescription copied from interface:ContentFileReturns partition for this file as aStructLike.- Specified by:
- partitionin interface- ContentFile<F>
 
- 
recordCountpublic long recordCount()Description copied from interface:ContentFileReturns the number of top-level records in the file.- Specified by:
- recordCountin interface- ContentFile<F>
 
- 
fileSizeInBytespublic long fileSizeInBytes()Description copied from interface:ContentFileReturns the file size in bytes.- Specified by:
- fileSizeInBytesin interface- ContentFile<F>
 
- 
columnSizesDescription copied from interface:ContentFileReturns if collected, map from column ID to the size of the column in bytes, null otherwise.- Specified by:
- columnSizesin interface- ContentFile<F>
 
- 
valueCountsDescription copied from interface:ContentFileReturns if collected, map from column ID to the count of its values (including null and NaN values), null otherwise.- Specified by:
- valueCountsin interface- ContentFile<F>
 
- 
nullValueCountsDescription copied from interface:ContentFileReturns if collected, map from column ID to its null value count, null otherwise.- Specified by:
- nullValueCountsin interface- ContentFile<F>
 
- 
nanValueCountsDescription copied from interface:ContentFileReturns if collected, map from column ID to its NaN value count, null otherwise.- Specified by:
- nanValueCountsin interface- ContentFile<F>
 
- 
lowerBoundsDescription copied from interface:ContentFileReturns if collected, map from column ID to value lower bounds, null otherwise.- Specified by:
- lowerBoundsin interface- ContentFile<F>
 
- 
upperBoundsDescription copied from interface:ContentFileReturns if collected, map from column ID to value upper bounds, null otherwise.- Specified by:
- upperBoundsin interface- ContentFile<F>
 
- 
keyMetadataDescription copied from interface:ContentFileReturns metadata about how this file is encrypted, or null if the file is stored in plain text.- Specified by:
- keyMetadatain interface- ContentFile<F>
 
- 
copyDescription copied from interface:ContentFileCopies this file. Manifest readers can reuse file instances; use this method to copy data when collecting files from tasks.- Specified by:
- copyin interface- ContentFile<F>
- Returns:
- a copy of this data file
 
- 
copyWithoutStatsDescription copied from interface:ContentFileCopies this file without file stats. Manifest readers can reuse file instances; use this method to copy data without stats when collecting files.- Specified by:
- copyWithoutStatsin interface- ContentFile<F>
- Returns:
- a copy of this data file, without lower bounds, upper bounds, value counts, null value counts, or nan value counts
 
- 
splitOffsetsDescription copied from interface:ContentFileReturns list of recommended split locations, if applicable, null otherwise.When available, this information is used for planning scan tasks whose boundaries are determined by these offsets. The returned list must be sorted in ascending order. - Specified by:
- splitOffsetsin interface- ContentFile<F>
 
- 
sortOrderIdDescription copied from interface:ContentFileReturns the sort order id of this file, which describes how the file is ordered. This information will be useful for merging data and equality delete files more efficiently when they share the same sort order id.- Specified by:
- sortOrderIdin interface- ContentFile<F>
 
- 
equalityFieldIdsDescription copied from interface:ContentFileReturns the set of field IDs used for equality comparison, in equality delete files.An equality delete file may contain additional data fields that are not used by equality comparison. The subset of columns in a delete file to be used in equality comparison are tracked by ID. Extra columns can be used to reconstruct changes and metrics from extra columns are used during job planning. - Specified by:
- equalityFieldIdsin interface- ContentFile<F>
- Returns:
- IDs of the fields used in equality comparison with the records in this delete file
 
- 
referencedDataFile
- 
contentOffset
- 
contentSizeInBytes
 
-