Class VectorizedArrowReader

    • Constructor Summary

      Constructors 
      Constructor Description
      VectorizedArrowReader​(org.apache.parquet.column.ColumnDescriptor desc, Types.NestedField icebergField, org.apache.arrow.memory.BufferAllocator ra, boolean setArrowValidityVector)  
    • Method Summary

      All Methods Static Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      void close()
      Release any resources allocated.
      static VectorizedArrowReader nulls()  
      VectorHolder read​(VectorHolder reuse, int numValsToRead)
      Reads a batch of type @param <T> and of size numRows
      void setBatchSize​(int batchSize)  
      void setRowGroupInfo​(org.apache.parquet.column.page.PageReadStore source, java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,​org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata)  
      java.lang.String toString()  
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
    • Constructor Detail

      • VectorizedArrowReader

        public VectorizedArrowReader​(org.apache.parquet.column.ColumnDescriptor desc,
                                     Types.NestedField icebergField,
                                     org.apache.arrow.memory.BufferAllocator ra,
                                     boolean setArrowValidityVector)
    • Method Detail

      • read

        public VectorHolder read​(VectorHolder reuse,
                                 int numValsToRead)
        Description copied from interface: VectorizedReader
        Reads a batch of type @param <T> and of size numRows
        Specified by:
        read in interface VectorizedReader<VectorHolder>
        Parameters:
        reuse - container for the last batch to be reused for next batch
        numValsToRead - number of rows to read
        Returns:
        batch of records of type @param <T>
      • setRowGroupInfo

        public void setRowGroupInfo​(org.apache.parquet.column.page.PageReadStore source,
                                    java.util.Map<org.apache.parquet.hadoop.metadata.ColumnPath,​org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata)
        Specified by:
        setRowGroupInfo in interface VectorizedReader<VectorHolder>
        Parameters:
        source - row group information for all the columns
        metadata - map of ColumnPath -> ColumnChunkMetaData for the row group
      • toString

        public java.lang.String toString()
        Overrides:
        toString in class java.lang.Object