Package org.apache.iceberg.mr.mapred
Class MapredIcebergInputFormat<T>
java.lang.Object
org.apache.iceberg.mr.mapred.MapredIcebergInputFormat<T>
- Type Parameters:
T
- Java class of records constructed by Iceberg; default isRecord
- Direct Known Subclasses:
HiveIcebergInputFormat
public class MapredIcebergInputFormat<T>
extends Object
implements org.apache.hadoop.mapred.InputFormat<Void,Container<T>>
Generic MR v1 InputFormat API for Iceberg.
-
Nested Class Summary
Modifier and TypeClassDescriptionstatic class
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionconfigure
(org.apache.hadoop.mapred.JobConf job) Configures theJobConf
to use theMapredIcebergInputFormat
and returns a helper to add further configuration.getRecordReader
(org.apache.hadoop.mapred.InputSplit split, org.apache.hadoop.mapred.JobConf job, org.apache.hadoop.mapred.Reporter reporter) org.apache.hadoop.mapred.InputSplit[]
getSplits
(org.apache.hadoop.mapred.JobConf job, int numSplits) static org.apache.hadoop.mapreduce.TaskAttemptContext
newTaskAttemptContext
(org.apache.hadoop.mapred.JobConf job, org.apache.hadoop.mapred.Reporter reporter)
-
Constructor Details
-
MapredIcebergInputFormat
public MapredIcebergInputFormat()
-
-
Method Details
-
configure
Configures theJobConf
to use theMapredIcebergInputFormat
and returns a helper to add further configuration.- Parameters:
job
- theJobConf
to configure
-
getSplits
public org.apache.hadoop.mapred.InputSplit[] getSplits(org.apache.hadoop.mapred.JobConf job, int numSplits) throws IOException - Specified by:
getSplits
in interfaceorg.apache.hadoop.mapred.InputFormat<Void,
Container<T>> - Throws:
IOException
-
getRecordReader
public org.apache.hadoop.mapred.RecordReader<Void,Container<T>> getRecordReader(org.apache.hadoop.mapred.InputSplit split, org.apache.hadoop.mapred.JobConf job, org.apache.hadoop.mapred.Reporter reporter) throws IOException - Specified by:
getRecordReader
in interfaceorg.apache.hadoop.mapred.InputFormat<Void,
Container<T>> - Throws:
IOException
-
newTaskAttemptContext
public static org.apache.hadoop.mapreduce.TaskAttemptContext newTaskAttemptContext(org.apache.hadoop.mapred.JobConf job, org.apache.hadoop.mapred.Reporter reporter)
-