Package org.apache.iceberg.io
package org.apache.iceberg.io
-
ClassDescriptionThis class acts as a helper for handling the closure of multiple resource.A convenience wrapper around
CloseableIterator
, providing auto-close functionality when all of the elements in the iterator are consumed.A data writer capable of writing to multiple specs and partitions that requires the incoming records to be properly clustered by partition spec and by partition within each spec.An equality delete writer capable of writing to multiple specs and partitions that requires the incoming delete records to be properly clustered by partition spec and by partition within each spec.A position delete writer capable of writing to multiple specs and partitions that requires the incoming delete records to be properly clustered by partition spec and by partition within each spec.Class that provides file-content caching during reading.Interface used to expose credentials held by a FileIO instance.DataWriter<T>A result of writing data files.This interface is intended as an extension for FileIO implementations that support being a delegate target.A result of writing delete files.A writer capable of writing data and equality deletes that may belong to different specs and partitions.A data writer capable of writing to multiple specs and partitions that keeps data writers for each seen spec/partition pair open until this writer is closed.A position delete writer capable of writing to multiple specs and partitions if the incoming stream of deletes is not ordered.FileAppender<D>Factory to create a newFileAppender
to write records.Pluggable module for reading, writing, and deleting files.Extension of MetricsContext for use with FileIO to define standard metrics that should be reported.FileWriter<T,R> A writer capable of writing files of a single type (i.e. data/delete) to one spec/partition.A factory for creating data and delete writers.An Iterator that filters another Iterator.An interface used to read input files usingSeekableInputStream
instances.Interface for providing data file locations to write tasks.An interface used to create output files usingPositionOutputStream
instances.Factory responsible for generating unique but recognizable data/delete file names.PartitioningWriter<T,R> A writer capable of writing files of a single type (i.e. data/delete) to multiple specs and partitions.A writer capable of writing data and position deletes that may belong to different specs and partitions.RangeReadable
is an interface that allows for implementations ofInputFile
streams to perform positional, range-based reads, which are more efficient than unbounded reads in many cloud provider object stores.FileIO implementation that uses location scheme to choose the correct FileIO implementation.A rolling data writer that splits incoming data into multiple files within one spec/partition based on the target file size.A rolling equality delete writer that splits incoming deletes into multiple files within one spec/partition based on the target file size.A rolling position delete writer that splits incoming deletes into multiple files within one spec/partition based on the target file size.SeekableInputStream
is an interface with the methods needed to read data from a file or Hadoop data stream.This interface is intended as an extension for FileIO implementations to provide additional prefix based operations that may be useful in performing supporting operations.TaskWriter<T>The writer interface could accept records and provide the generated data files.