Package org.apache.iceberg.mr.mapreduce
Class IcebergInputFormat<T>
- java.lang.Object
-
- org.apache.hadoop.mapreduce.InputFormat<java.lang.Void,T>
-
- org.apache.iceberg.mr.mapreduce.IcebergInputFormat<T>
-
- Type Parameters:
T
- T is the in memory data model which can either be Pig tuples, Hive rows. Default is Iceberg records
public class IcebergInputFormat<T> extends org.apache.hadoop.mapreduce.InputFormat<java.lang.Void,T>
Generic Mrv2 InputFormat API for Iceberg.
-
-
Constructor Summary
Constructors Constructor Description IcebergInputFormat()
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description static InputFormatConfig.ConfigBuilder
configure(org.apache.hadoop.mapreduce.Job job)
Configures theJob
to use theIcebergInputFormat
and returns a helper to add further configuration.org.apache.hadoop.mapreduce.RecordReader<java.lang.Void,T>
createRecordReader(org.apache.hadoop.mapreduce.InputSplit split, org.apache.hadoop.mapreduce.TaskAttemptContext context)
java.util.List<org.apache.hadoop.mapreduce.InputSplit>
getSplits(org.apache.hadoop.mapreduce.JobContext context)
-
-
-
Method Detail
-
configure
public static InputFormatConfig.ConfigBuilder configure(org.apache.hadoop.mapreduce.Job job)
Configures theJob
to use theIcebergInputFormat
and returns a helper to add further configuration.- Parameters:
job
- theJob
to configure
-
getSplits
public java.util.List<org.apache.hadoop.mapreduce.InputSplit> getSplits(org.apache.hadoop.mapreduce.JobContext context)
- Specified by:
getSplits
in classorg.apache.hadoop.mapreduce.InputFormat<java.lang.Void,T>
-
createRecordReader
public org.apache.hadoop.mapreduce.RecordReader<java.lang.Void,T> createRecordReader(org.apache.hadoop.mapreduce.InputSplit split, org.apache.hadoop.mapreduce.TaskAttemptContext context)
- Specified by:
createRecordReader
in classorg.apache.hadoop.mapreduce.InputFormat<java.lang.Void,T>
-
-