Class BaseVectorizedParquetValuesReader
java.lang.Object
org.apache.parquet.column.values.ValuesReader
org.apache.iceberg.arrow.vectorized.parquet.BaseVectorizedParquetValuesReader
- Direct Known Subclasses:
VectorizedDictionaryEncodedParquetValuesReader
,VectorizedParquetDefinitionLevelReader
public class BaseVectorizedParquetValuesReader
extends org.apache.parquet.column.values.ValuesReader
A values reader for Parquet's run-length encoded data that reads column data in batches instead
of one value at a time. This is based off of the VectorizedRleValuesReader class in Apache Spark
with these changes:
Writes batches of values retrieved to Arrow vectors. If all pages of a column within the row group are not dictionary encoded, then dictionary ids are eagerly decoded into actual values before writing them to the Arrow vectors
-
Constructor Summary
ConstructorDescriptionBaseVectorizedParquetValuesReader
(int maxDefLevel, boolean setValidityVector) BaseVectorizedParquetValuesReader
(int bitWidth, int maxDefLevel, boolean setValidityVector) BaseVectorizedParquetValuesReader
(int bitWidth, int maxDefLevel, boolean readLength, boolean setValidityVector) -
Method Summary
Modifier and TypeMethodDescriptionvoid
initFromPage
(int valueCount, org.apache.parquet.bytes.ByteBufferInputStream in) boolean
int
int
void
skip()
Methods inherited from class org.apache.parquet.column.values.ValuesReader
getNextOffset, initFromPage, initFromPage, readBytes, readDouble, readFloat, readLong, skip, updateNextOffset
-
Constructor Details
-
BaseVectorizedParquetValuesReader
public BaseVectorizedParquetValuesReader(int maxDefLevel, boolean setValidityVector) -
BaseVectorizedParquetValuesReader
public BaseVectorizedParquetValuesReader(int bitWidth, int maxDefLevel, boolean setValidityVector) -
BaseVectorizedParquetValuesReader
public BaseVectorizedParquetValuesReader(int bitWidth, int maxDefLevel, boolean readLength, boolean setValidityVector)
-
-
Method Details
-
initFromPage
public void initFromPage(int valueCount, org.apache.parquet.bytes.ByteBufferInputStream in) throws IOException - Overrides:
initFromPage
in classorg.apache.parquet.column.values.ValuesReader
- Throws:
IOException
-
readBoolean
public boolean readBoolean()- Overrides:
readBoolean
in classorg.apache.parquet.column.values.ValuesReader
-
skip
public void skip()- Specified by:
skip
in classorg.apache.parquet.column.values.ValuesReader
-
readValueDictionaryId
public int readValueDictionaryId()- Overrides:
readValueDictionaryId
in classorg.apache.parquet.column.values.ValuesReader
-
readInteger
public int readInteger()- Overrides:
readInteger
in classorg.apache.parquet.column.values.ValuesReader
-