Class VectorizedDeltaByteArrayValuesReader

java.lang.Object
org.apache.parquet.column.values.ValuesReader
org.apache.iceberg.arrow.vectorized.parquet.VectorizedDeltaByteArrayValuesReader

public class VectorizedDeltaByteArrayValuesReader extends org.apache.parquet.column.values.ValuesReader
A VectorizedValuesReader implementation for DELTA_BYTE_ARRAY encoding. This encoding stores delta-encoded prefix lengths followed by suffixes encoded as DELTA_LENGTH_BYTE_ARRAY. Each value is reconstructed by taking the prefix of the previous value and appending the suffix. This is adapted from Spark's VectorizedDeltaByteArrayReader.
See Also:
  • Field Summary

    Fields
    Modifier and Type
    Field
    Description
    static final int
     
    static final int
     
    static final int
     
    static final int
     
  • Constructor Summary

    Constructors
    Constructor
    Description
     
  • Method Summary

    Modifier and Type
    Method
    Description
    void
    initFromPage(int valueCount, org.apache.parquet.bytes.ByteBufferInputStream in)
     
    org.apache.parquet.io.api.Binary
    readBinary(int len)
     
    default boolean
    Read a single boolean
    default byte
    Read a single byte
    default double
    Read a single double
    default void
    readDoubles(int total, org.apache.arrow.vector.FieldVector vec, int rowId)
    Read `total` doubles into `vec` starting at `vec[rowId]`
    default float
    Read a single float
    default void
    readFloats(int total, org.apache.arrow.vector.FieldVector vec, int rowId)
    Read `total` floats into `vec` starting at `vec[rowId]`
    int
     
    default void
    readIntegers(int total, org.apache.arrow.vector.FieldVector vec, int rowId)
    Read `total` integers into `vec` starting at `vec[rowId]`
    default long
    Read a single long
    default void
    readLongs(int total, org.apache.arrow.vector.FieldVector vec, int rowId)
    Read `total` longs into `vec` starting at `vec[rowId]`
    default short
    Read a single short
    void
     

    Methods inherited from class org.apache.parquet.column.values.ValuesReader

    getNextOffset, initFromPage, initFromPage, readBoolean, readBytes, readDouble, readFloat, readLong, readValueDictionaryId, skip, updateNextOffset

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Field Details

  • Constructor Details

    • VectorizedDeltaByteArrayValuesReader

      public VectorizedDeltaByteArrayValuesReader()
  • Method Details

    • initFromPage

      public void initFromPage(int valueCount, org.apache.parquet.bytes.ByteBufferInputStream in) throws IOException
      Overrides:
      initFromPage in class org.apache.parquet.column.values.ValuesReader
      Throws:
      IOException
    • readInteger

      public int readInteger()
      Overrides:
      readInteger in class org.apache.parquet.column.values.ValuesReader
    • readBinary

      public org.apache.parquet.io.api.Binary readBinary(int len)
    • skip

      public void skip()
      Specified by:
      skip in class org.apache.parquet.column.values.ValuesReader
    • readBoolean

      default boolean readBoolean()
      Read a single boolean
    • readByte

      default byte readByte()
      Read a single byte
    • readShort

      default short readShort()
      Read a single short
    • readLong

      default long readLong()
      Read a single long
    • readFloat

      default float readFloat()
      Read a single float
    • readDouble

      default double readDouble()
      Read a single double
    • readIntegers

      default void readIntegers(int total, org.apache.arrow.vector.FieldVector vec, int rowId)
      Read `total` integers into `vec` starting at `vec[rowId]`
    • readLongs

      default void readLongs(int total, org.apache.arrow.vector.FieldVector vec, int rowId)
      Read `total` longs into `vec` starting at `vec[rowId]`
    • readFloats

      default void readFloats(int total, org.apache.arrow.vector.FieldVector vec, int rowId)
      Read `total` floats into `vec` starting at `vec[rowId]`
    • readDoubles

      default void readDoubles(int total, org.apache.arrow.vector.FieldVector vec, int rowId)
      Read `total` doubles into `vec` starting at `vec[rowId]`