Package org.apache.iceberg.mr.mapreduce
Class IcebergInputFormat<T>
java.lang.Object
org.apache.hadoop.mapreduce.InputFormat<Void,T>
org.apache.iceberg.mr.mapreduce.IcebergInputFormat<T>
- Type Parameters:
T
- T is the in memory data model which can either be Pig tuples, Hive rows. Default is Iceberg records
Generic Mrv2 InputFormat API for Iceberg.
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionconfigure
(org.apache.hadoop.mapreduce.Job job) Configures theJob
to use theIcebergInputFormat
and returns a helper to add further configuration.createRecordReader
(org.apache.hadoop.mapreduce.InputSplit split, org.apache.hadoop.mapreduce.TaskAttemptContext context) List
<org.apache.hadoop.mapreduce.InputSplit> getSplits
(org.apache.hadoop.mapreduce.JobContext context)
-
Constructor Details
-
IcebergInputFormat
public IcebergInputFormat()
-
-
Method Details
-
configure
Configures theJob
to use theIcebergInputFormat
and returns a helper to add further configuration.- Parameters:
job
- theJob
to configure
-
getSplits
public List<org.apache.hadoop.mapreduce.InputSplit> getSplits(org.apache.hadoop.mapreduce.JobContext context) -
createRecordReader
-