Class VectorizedPageIterator

    • Constructor Detail

      • VectorizedPageIterator

        public VectorizedPageIterator​(org.apache.parquet.column.ColumnDescriptor desc,
                                      java.lang.String writerVersion,
                                      boolean setValidityVector)
    • Method Detail

      • setAllPagesDictEncoded

        public void setAllPagesDictEncoded​(boolean allDictEncoded)
      • initDataReader

        protected void initDataReader​(org.apache.parquet.column.Encoding dataEncoding,
                                      org.apache.parquet.bytes.ByteBufferInputStream in,
                                      int valueCount)
        Specified by:
        initDataReader in class BasePageIterator
      • producesDictionaryEncodedVector

        public boolean producesDictionaryEncodedVector()
      • initDefinitionLevelsReader

        protected void initDefinitionLevelsReader​(org.apache.parquet.column.page.DataPageV1 dataPageV1,
                                                  org.apache.parquet.column.ColumnDescriptor desc,
                                                  org.apache.parquet.bytes.ByteBufferInputStream in,
                                                  int triplesCount)
                                           throws java.io.IOException
        Specified by:
        initDefinitionLevelsReader in class BasePageIterator
        Throws:
        java.io.IOException
      • initDefinitionLevelsReader

        protected void initDefinitionLevelsReader​(org.apache.parquet.column.page.DataPageV2 dataPageV2,
                                                  org.apache.parquet.column.ColumnDescriptor desc)
                                           throws java.io.IOException
        Specified by:
        initDefinitionLevelsReader in class BasePageIterator
        Throws:
        java.io.IOException
      • nextBatchDictionaryIds

        public int nextBatchDictionaryIds​(org.apache.arrow.vector.IntVector vector,
                                          int expectedBatchSize,
                                          int numValsInVector,
                                          NullabilityHolder holder)
        Method for reading a batch of dictionary ids from the dictionary encoded data pages. Like definition levels, dictionary ids in Parquet are RLE/bin-packed encoded as well.