Class ParquetUtil

java.lang.Object
org.apache.iceberg.parquet.ParquetUtil

public class ParquetUtil extends Object
  • Method Details

    • fileMetrics

      public static Metrics fileMetrics(InputFile file, MetricsConfig metricsConfig)
    • fileMetrics

      public static Metrics fileMetrics(InputFile file, MetricsConfig metricsConfig, NameMapping nameMapping)
    • footerMetrics

      public static Metrics footerMetrics(org.apache.parquet.hadoop.metadata.ParquetMetadata metadata, Stream<FieldMetrics<?>> fieldMetrics, MetricsConfig metricsConfig)
    • footerMetrics

      public static Metrics footerMetrics(org.apache.parquet.hadoop.metadata.ParquetMetadata metadata, Stream<FieldMetrics<?>> fieldMetrics, MetricsConfig metricsConfig, NameMapping nameMapping)
    • getSplitOffsets

      public static List<Long> getSplitOffsets(org.apache.parquet.hadoop.metadata.ParquetMetadata md)
      Returns a list of offsets in ascending order determined by the starting position of the row groups.
    • hasNonDictionaryPages

      public static boolean hasNonDictionaryPages(org.apache.parquet.hadoop.metadata.ColumnChunkMetaData meta)
    • hasNoBloomFilterPages

      public static boolean hasNoBloomFilterPages(org.apache.parquet.hadoop.metadata.ColumnChunkMetaData meta)
    • readDictionary

      public static org.apache.parquet.column.Dictionary readDictionary(org.apache.parquet.column.ColumnDescriptor desc, org.apache.parquet.column.page.PageReader pageSource)
    • isIntType

      public static boolean isIntType(org.apache.parquet.schema.PrimitiveType primitiveType)
    • extractTimestampInt96

      public static long extractTimestampInt96(ByteBuffer buffer)
      Method to read timestamp (parquet Int96) from bytebuffer. Read 12 bytes in byteBuffer: 8 bytes (time of day nanos) + 4 bytes(julianDay)