Package org.apache.iceberg.parquet
Class ParquetUtil
- java.lang.Object
-
- org.apache.iceberg.parquet.ParquetUtil
-
public class ParquetUtil extends java.lang.Object
-
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static longextractTimestampInt96(java.nio.ByteBuffer buffer)Method to read timestamp (parquet Int96) from bytebuffer.static MetricsfileMetrics(InputFile file, MetricsConfig metricsConfig)static MetricsfileMetrics(InputFile file, MetricsConfig metricsConfig, NameMapping nameMapping)static MetricsfooterMetrics(org.apache.parquet.hadoop.metadata.ParquetMetadata metadata, java.util.stream.Stream<FieldMetrics<?>> fieldMetrics, MetricsConfig metricsConfig)static MetricsfooterMetrics(org.apache.parquet.hadoop.metadata.ParquetMetadata metadata, java.util.stream.Stream<FieldMetrics<?>> fieldMetrics, MetricsConfig metricsConfig, NameMapping nameMapping)static java.util.List<java.lang.Long>getSplitOffsets(org.apache.parquet.hadoop.metadata.ParquetMetadata md)Returns a list of offsets in ascending order determined by the starting position of the row groups.static booleanhasNoBloomFilterPages(org.apache.parquet.hadoop.metadata.ColumnChunkMetaData meta)static booleanhasNonDictionaryPages(org.apache.parquet.hadoop.metadata.ColumnChunkMetaData meta)static booleanisIntType(org.apache.parquet.schema.PrimitiveType primitiveType)static org.apache.parquet.column.DictionaryreadDictionary(org.apache.parquet.column.ColumnDescriptor desc, org.apache.parquet.column.page.PageReader pageSource)
-
-
-
Method Detail
-
fileMetrics
public static Metrics fileMetrics(InputFile file, MetricsConfig metricsConfig)
-
fileMetrics
public static Metrics fileMetrics(InputFile file, MetricsConfig metricsConfig, NameMapping nameMapping)
-
footerMetrics
public static Metrics footerMetrics(org.apache.parquet.hadoop.metadata.ParquetMetadata metadata, java.util.stream.Stream<FieldMetrics<?>> fieldMetrics, MetricsConfig metricsConfig)
-
footerMetrics
public static Metrics footerMetrics(org.apache.parquet.hadoop.metadata.ParquetMetadata metadata, java.util.stream.Stream<FieldMetrics<?>> fieldMetrics, MetricsConfig metricsConfig, NameMapping nameMapping)
-
getSplitOffsets
public static java.util.List<java.lang.Long> getSplitOffsets(org.apache.parquet.hadoop.metadata.ParquetMetadata md)
Returns a list of offsets in ascending order determined by the starting position of the row groups.
-
hasNonDictionaryPages
public static boolean hasNonDictionaryPages(org.apache.parquet.hadoop.metadata.ColumnChunkMetaData meta)
-
hasNoBloomFilterPages
public static boolean hasNoBloomFilterPages(org.apache.parquet.hadoop.metadata.ColumnChunkMetaData meta)
-
readDictionary
public static org.apache.parquet.column.Dictionary readDictionary(org.apache.parquet.column.ColumnDescriptor desc, org.apache.parquet.column.page.PageReader pageSource)
-
isIntType
public static boolean isIntType(org.apache.parquet.schema.PrimitiveType primitiveType)
-
extractTimestampInt96
public static long extractTimestampInt96(java.nio.ByteBuffer buffer)
Method to read timestamp (parquet Int96) from bytebuffer. Read 12 bytes in byteBuffer: 8 bytes (time of day nanos) + 4 bytes(julianDay)
-
-