Package org.apache.iceberg.parquet
Class ParquetUtil
- java.lang.Object
-
- org.apache.iceberg.parquet.ParquetUtil
-
public class ParquetUtil extends java.lang.Object
-
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static long
extractTimestampInt96(java.nio.ByteBuffer buffer)
Method to read timestamp (parquet Int96) from bytebuffer.static Metrics
fileMetrics(InputFile file, MetricsConfig metricsConfig)
static Metrics
fileMetrics(InputFile file, MetricsConfig metricsConfig, NameMapping nameMapping)
static Metrics
footerMetrics(org.apache.parquet.hadoop.metadata.ParquetMetadata metadata, java.util.stream.Stream<FieldMetrics<?>> fieldMetrics, MetricsConfig metricsConfig)
static Metrics
footerMetrics(org.apache.parquet.hadoop.metadata.ParquetMetadata metadata, java.util.stream.Stream<FieldMetrics<?>> fieldMetrics, MetricsConfig metricsConfig, NameMapping nameMapping)
static java.util.List<java.lang.Long>
getSplitOffsets(org.apache.parquet.hadoop.metadata.ParquetMetadata md)
Returns a list of offsets in ascending order determined by the starting position of the row groups.static boolean
hasNoBloomFilterPages(org.apache.parquet.hadoop.metadata.ColumnChunkMetaData meta)
static boolean
hasNonDictionaryPages(org.apache.parquet.hadoop.metadata.ColumnChunkMetaData meta)
static boolean
isIntType(org.apache.parquet.schema.PrimitiveType primitiveType)
static org.apache.parquet.column.Dictionary
readDictionary(org.apache.parquet.column.ColumnDescriptor desc, org.apache.parquet.column.page.PageReader pageSource)
-
-
-
Method Detail
-
fileMetrics
public static Metrics fileMetrics(InputFile file, MetricsConfig metricsConfig)
-
fileMetrics
public static Metrics fileMetrics(InputFile file, MetricsConfig metricsConfig, NameMapping nameMapping)
-
footerMetrics
public static Metrics footerMetrics(org.apache.parquet.hadoop.metadata.ParquetMetadata metadata, java.util.stream.Stream<FieldMetrics<?>> fieldMetrics, MetricsConfig metricsConfig)
-
footerMetrics
public static Metrics footerMetrics(org.apache.parquet.hadoop.metadata.ParquetMetadata metadata, java.util.stream.Stream<FieldMetrics<?>> fieldMetrics, MetricsConfig metricsConfig, NameMapping nameMapping)
-
getSplitOffsets
public static java.util.List<java.lang.Long> getSplitOffsets(org.apache.parquet.hadoop.metadata.ParquetMetadata md)
Returns a list of offsets in ascending order determined by the starting position of the row groups.
-
hasNonDictionaryPages
public static boolean hasNonDictionaryPages(org.apache.parquet.hadoop.metadata.ColumnChunkMetaData meta)
-
hasNoBloomFilterPages
public static boolean hasNoBloomFilterPages(org.apache.parquet.hadoop.metadata.ColumnChunkMetaData meta)
-
readDictionary
public static org.apache.parquet.column.Dictionary readDictionary(org.apache.parquet.column.ColumnDescriptor desc, org.apache.parquet.column.page.PageReader pageSource)
-
isIntType
public static boolean isIntType(org.apache.parquet.schema.PrimitiveType primitiveType)
-
extractTimestampInt96
public static long extractTimestampInt96(java.nio.ByteBuffer buffer)
Method to read timestamp (parquet Int96) from bytebuffer. Read 12 bytes in byteBuffer: 8 bytes (time of day nanos) + 4 bytes(julianDay)
-
-