Package org.apache.iceberg.io
package org.apache.iceberg.io
-
ClassDescriptionThis class acts as a helper for handling the closure of multiple resource.A convenience wrapper around
CloseableIterator, providing auto-close functionality when all of the elements in the iterator are consumed.A data writer capable of writing to multiple specs and partitions that requires the incoming records to be properly clustered by partition spec and by partition within each spec.An equality delete writer capable of writing to multiple specs and partitions that requires the incoming delete records to be properly clustered by partition spec and by partition within each spec.A position delete writer capable of writing to multiple specs and partitions that requires the incoming delete records to be properly clustered by partition spec and by partition within each spec.Class that provides file-content caching during reading.Interface used to expose credentials held by a FileIO instance.DataWriter<T>A result of writing data files.This interface is intended as an extension for FileIO implementations that support being a delegate target.A result of writing delete files.A writer capable of writing data and equality deletes that may belong to different specs and partitions.A data writer capable of writing to multiple specs and partitions that keeps data writers for each seen spec/partition pair open until this writer is closed.A position delete writer capable of writing to multiple specs and partitions if the incoming stream of deletes is not ordered.FileAppender<D>Factory to create a newFileAppenderto write records.Pluggable module for reading, writing, and deleting files.Extension of MetricsContext for use with FileIO to define standard metrics that should be reported.Keeps track of theFileIOinstance of the givenTableOperationsinstance and closes theFileIOwhenFileIOTracker.close()gets calledFileWriter<T,R> A writer capable of writing files of a single type (i.e.A factory for creating data and delete writers.An Iterator that filters another Iterator.An interface used to read input files usingSeekableInputStreaminstances.Interface for providing data file locations to write tasks.An interface used to create output files usingPositionOutputStreaminstances.Factory responsible for generating unique but recognizable data/delete file names.PartitioningDVWriter is a PartitioningWriter implementation that accumulates deleted positions for data files across different partitions and writes out deletion vector files.PartitioningWriter<T,R> A writer capable of writing files of a single type (i.e.A writer capable of writing data and position deletes that may belong to different specs and partitions.RangeReadableis an interface that allows for implementations ofInputFilestreams to perform positional, range-based reads, which are more efficient than unbounded reads in many cloud provider object stores.FileIO implementation that uses location scheme to choose the correct FileIO implementation.A rolling data writer that splits incoming data into multiple files within one spec/partition based on the target file size.A rolling equality delete writer that splits incoming deletes into multiple files within one spec/partition based on the target file size.A rolling position delete writer that splits incoming deletes into multiple files within one spec/partition based on the target file size.SeekableInputStreamis an interface with the methods needed to read data from a file or Hadoop data stream.This interface is intended as an extension for FileIO implementations to provide additional prefix based operations that may be useful in performing supporting operations.This interface is intended as an extension for FileIO implementations to provide additional best-effort recovery operations that can be useful for repairing corrupted tables where there are reachable files missing from disk.TaskWriter<T>The writer interface could accept records and provide the generated data files.