F
- the concrete Java class of a ContentFile instance.public interface ContentFile<F>
DataFile
and DeleteFile
that exposes common methods.Modifier and Type | Method and Description |
---|---|
java.util.Map<java.lang.Integer,java.lang.Long> |
columnSizes()
Returns if collected, map from column ID to the size of the column in bytes, null otherwise.
|
FileContent |
content()
Returns type of content stored in the file; one of DATA, POSITION_DELETES, or EQUALITY_DELETES.
|
F |
copy()
Copies this file.
|
F |
copyWithoutStats()
Copies this file without file stats.
|
java.util.List<java.lang.Integer> |
equalityFieldIds()
Returns the set of field IDs used for equality comparison, in equality delete files.
|
long |
fileSizeInBytes()
Returns the file size in bytes.
|
FileFormat |
format()
Returns format of the file.
|
java.nio.ByteBuffer |
keyMetadata()
Returns metadata about how this file is encrypted, or null if the file is stored in plain text.
|
java.util.Map<java.lang.Integer,java.nio.ByteBuffer> |
lowerBounds()
Returns if collected, map from column ID to value lower bounds, null otherwise.
|
java.util.Map<java.lang.Integer,java.lang.Long> |
nanValueCounts()
Returns if collected, map from column ID to its NaN value count, null otherwise.
|
java.util.Map<java.lang.Integer,java.lang.Long> |
nullValueCounts()
Returns if collected, map from column ID to its null value count, null otherwise.
|
StructLike |
partition()
Returns partition for this file as a
StructLike . |
java.lang.CharSequence |
path()
Returns fully qualified path to the file, suitable for constructing a Hadoop Path.
|
java.lang.Long |
pos()
Returns the ordinal position of the file in a manifest, or null if it was not read from a manifest.
|
long |
recordCount()
Returns the number of top-level records in the file.
|
default java.lang.Integer |
sortOrderId()
Returns the sort order id of this file, which describes how the file is ordered.
|
int |
specId()
Returns id of the partition spec used for partition metadata.
|
java.util.List<java.lang.Long> |
splitOffsets()
Returns list of recommended split locations, if applicable, null otherwise.
|
java.util.Map<java.lang.Integer,java.nio.ByteBuffer> |
upperBounds()
Returns if collected, map from column ID to value upper bounds, null otherwise.
|
java.util.Map<java.lang.Integer,java.lang.Long> |
valueCounts()
Returns if collected, map from column ID to the count of its non-null values, null otherwise.
|
java.lang.Long pos()
int specId()
FileContent content()
java.lang.CharSequence path()
FileFormat format()
StructLike partition()
StructLike
.long recordCount()
long fileSizeInBytes()
java.util.Map<java.lang.Integer,java.lang.Long> columnSizes()
java.util.Map<java.lang.Integer,java.lang.Long> valueCounts()
java.util.Map<java.lang.Integer,java.lang.Long> nullValueCounts()
java.util.Map<java.lang.Integer,java.lang.Long> nanValueCounts()
java.util.Map<java.lang.Integer,java.nio.ByteBuffer> lowerBounds()
java.util.Map<java.lang.Integer,java.nio.ByteBuffer> upperBounds()
java.nio.ByteBuffer keyMetadata()
java.util.List<java.lang.Long> splitOffsets()
When available, this information is used for planning scan tasks whose boundaries are determined by these offsets. The returned list must be sorted in ascending order.
java.util.List<java.lang.Integer> equalityFieldIds()
An equality delete file may contain additional data fields that are not used by equality comparison. The subset of columns in a delete file to be used in equality comparison are tracked by ID. Extra columns can be used to reconstruct changes and metrics from extra columns are used during job planning.
default java.lang.Integer sortOrderId()
F copy()
F copyWithoutStats()