Class VectorizedArrowReader

    • Constructor Detail

      • VectorizedArrowReader

        public VectorizedArrowReader​(org.apache.parquet.column.ColumnDescriptor desc,
                                     Types.NestedField icebergField,
                                     org.apache.arrow.memory.BufferAllocator ra,
                                     boolean setArrowValidityVector)
    • Method Detail

      • read

        public VectorHolder read​(VectorHolder reuse,
                                 int numValsToRead)
        Description copied from interface: VectorizedReader
        Reads a batch of type @param <T> and of size numRows
        Specified by:
        read in interface VectorizedReader<VectorHolder>
        Parameters:
        reuse - container for the last batch to be reused for next batch
        numValsToRead - number of rows to read
        Returns:
        batch of records of type @param <T>
      • setRowGroupInfo

        public void setRowGroupInfo​(org.apache.parquet.column.page.PageReadStore source,
                                    java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,​org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata,
                                    long rowPosition)
        Description copied from interface: VectorizedReader
        Sets the row group information to be used with this reader
        Specified by:
        setRowGroupInfo in interface VectorizedReader<VectorHolder>
        Parameters:
        source - row group information for all the columns
        metadata - map of ColumnPath -> ColumnChunkMetaData for the row group
        rowPosition - the row group's row offset in the parquet file
      • toString

        public java.lang.String toString()
        Overrides:
        toString in class java.lang.Object
      • positionsWithSetArrowValidityVector

        public static VectorizedArrowReader positionsWithSetArrowValidityVector()