org.apache.iceberg

package org.apache.iceberg

Class

Description

Accessor<T>

Accessors

Position2Accessor and Position3Accessor here is an optimization.

AddedRowsScanTask

A scan task for inserts generated by adding a data file to the table.

AllDataFilesTable

A Table implementation that exposes a table's valid data files as rows.

AllDataFilesTable.AllDataFilesTableScan

AllDeleteFilesTable

A Table implementation that exposes its valid delete files as rows.

AllDeleteFilesTable.AllDeleteFilesTableScan

AllEntriesTable

A Table implementation that exposes a table's manifest entries as rows, for both delete and data files.

AllFilesTable

A Table implementation that exposes its valid files as rows.

AllFilesTable.AllFilesTableScan

AllManifestsTable

A Table implementation that exposes a table's valid manifest files as rows.

AllManifestsTable.AllManifestsTableScan

AppendFiles

API for appending new files in a table.

BaseCombinedScanTask

BaseFileScanTask

BaseMetadataTable

Base class for metadata tables.

BaseMetastoreCatalog

BaseMetastoreOperations

BaseMetastoreOperations.CommitStatus

BaseMetastoreTableOperations

BaseOverwriteFiles

BaseReplacePartitions

BaseReplaceSortOrder

BaseRewriteManifests

BaseRowDelta

BaseScanTaskGroup<T extends ScanTask>

BaseTable

Base Table implementation.

BaseTransaction

BatchScan

API for configuring a batch scan.

BlobMetadata

A metadata about a statistics or indices blob.

CachingCatalog

Class that wraps an Iceberg Catalog to cache tables.

CatalogProperties

CatalogUtil

ChangelogOperation

An enum representing possible operations in a changelog.

ChangelogScanTask

A changelog scan task.

ChangelogUtil

ClientPool<C,E extends Exception>

ClientPool.Action<R,C,E extends Exception>

ClientPoolImpl<C,E extends Exception>

CombinedScanTask

A scan task made of several ranges from files.

ContentFile<F>

Superinterface of DataFile and DeleteFile that exposes common methods.

ContentFileParser

ContentScanTask<F extends ContentFile<F>>

A scan task over a range of bytes in a content file.

DataFile

Interface for data files listed in a table manifest.

DataFiles

DataFiles.Builder

DataFilesTable

A Table implementation that exposes a table's data files as rows.

DataFilesTable.DataFilesTableScan

DataOperations

Data operations that produce snapshots.

DataTableScan

DataTask

A task that returns data as rows instead of where to read data.

DeletedDataFileScanTask

A scan task for deletes generated by removing a data file from the table.

DeletedRowsScanTask

A scan task for deletes generated by adding delete files to the table.

DeleteFile

Interface for delete files listed in a table delete manifest.

DeleteFiles

API for deleting files from a table.

DeleteFilesTable

A Table implementation that exposes a table's delete files as rows.

DeleteFilesTable.DeleteFilesTableScan

DistributionMode

Enum of supported write distribution mode, it defines the write behavior of batch or streaming job:

DoubleFieldMetrics

Iceberg internally tracked field level metrics, used by Parquet and ORC writers only.

DoubleFieldMetrics.Builder

EncryptedKeyParser

EnvironmentContext

ExpireSnapshots

API for removing old snapshots from a table.

FieldMetrics<T>

Iceberg internally tracked field level metrics.

FileContent

Content type stored in a file, one of DATA, POSITION_DELETES, or EQUALITY_DELETES.

FileFormat

Enum of supported file formats.

FileMetadata

FileMetadata.Builder

Files

FileScanTask

A scan task over a range of bytes in a single data file.

FileScanTaskParser

FilesTable

A Table implementation that exposes a table's files as rows.

FilesTable.FilesTableScan

FindFiles

FindFiles.Builder

FloatFieldMetrics

Iceberg internally tracked field level metrics, used by Parquet and ORC writers only.

FloatFieldMetrics.Builder

GenericBlobMetadata

GenericManifestFile

GenericManifestFile.CopyBuilder

GenericPartitionFieldSummary

GenericPartitionStatisticsFile

GenericStatisticsFile

GuavaClasses

HasTableOperations

Used to expose a table's TableOperations.

HistoryEntry

Table history entry.

HistoryTable

A Table implementation that exposes a table's history as rows.

IcebergBuild

Loads iceberg-version.properties with build information.

IncrementalAppendScan

API for configuring an incremental table scan for appends only snapshots

IncrementalChangelogScan

API for configuring a scan for table changes.

IncrementalScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>

API for configuring an incremental scan.

InternalData

InternalData.ReadBuilder

InternalData.WriteBuilder

InternalParquet

IsolationLevel

An isolation level in a table.

LocationProviders

LockManager

An interface for locking, used to ensure commit isolation.

ManageSnapshots

API for managing snapshots.

ManifestContent

Content type stored in a manifest file, either DATA or DELETES.

ManifestEntriesTable

A Table implementation that exposes a table's manifest entries as rows, for both delete and data files.

ManifestFile

Represents a manifest file that can be scanned to find files in a table.

ManifestFile.PartitionFieldSummary

Summarizes the values of one partition field stored in a manifest file.

ManifestFiles

ManifestReader<F extends ContentFile<F>>

Base reader for data and delete manifest files.

ManifestReader.FileType

ManifestsTable

A Table implementation that exposes a table's manifest files as rows.

ManifestWriter<F extends ContentFile<F>>

Writer for manifest files.

MergeableScanTask<ThisT>

A scan task that can be potentially merged with other scan tasks.

MetadataColumns

MetadataLogEntriesTable

MetadataTableType

MetadataTableUtils

MetadataUpdate

Represents a change to table or view metadata.

MetadataUpdate.AddEncryptionKey

MetadataUpdate.AddPartitionSpec

MetadataUpdate.AddSchema

MetadataUpdate.AddSnapshot

MetadataUpdate.AddSortOrder

MetadataUpdate.AddViewVersion

MetadataUpdate.AssignUUID

MetadataUpdate.RemoveEncryptionKey

MetadataUpdate.RemovePartitionSpecs

MetadataUpdate.RemovePartitionStatistics

MetadataUpdate.RemoveProperties

MetadataUpdate.RemoveSchemas

MetadataUpdate.RemoveSnapshotRef

MetadataUpdate.RemoveSnapshots

MetadataUpdate.RemoveStatistics

MetadataUpdate.SetCurrentSchema

MetadataUpdate.SetCurrentViewVersion

MetadataUpdate.SetDefaultPartitionSpec

MetadataUpdate.SetDefaultSortOrder

MetadataUpdate.SetLocation

MetadataUpdate.SetPartitionStatistics

MetadataUpdate.SetProperties

MetadataUpdate.SetSnapshotRef

MetadataUpdate.SetStatistics

MetadataUpdate.UpgradeFormatVersion

MetadataUpdateParser

Metrics

Iceberg file format metrics.

MetricsConfig

MetricsModes

This class defines different metrics modes, which allow users to control the collection of value_counts, null_value_counts, nan_value_counts, lower_bounds, upper_bounds for different columns in metadata.

MetricsModes.Counts

Under this mode, only value_counts, null_value_counts, nan_value_counts are persisted.

MetricsModes.Full

Under this mode, value_counts, null_value_counts, nan_value_counts and full lower_bounds, upper_bounds are persisted.

MetricsModes.MetricsMode

A metrics calculation mode.

MetricsModes.None

Under this mode, value_counts, null_value_counts, nan_value_counts, lower_bounds, upper_bounds are not persisted.

MetricsModes.Truncate

Under this mode, value_counts, null_value_counts, nan_value_counts and truncated lower_bounds, upper_bounds are persisted.

MetricsUtil

MetricsUtil.ReadableColMetricsStruct

A struct of readable metric values for a primitive column

MetricsUtil.ReadableMetricColDefinition

Fixed definition of a readable metric column, ie a mapping of a raw metric to a readable metric

MetricsUtil.ReadableMetricColDefinition.MetricFunction

MetricsUtil.ReadableMetricColDefinition.TypeFunction

MetricsUtil.ReadableMetricsStruct

A struct, consisting of all MetricsUtil.ReadableColMetricsStruct for all primitive columns of the table

MicroBatches

MicroBatches.MicroBatch

MicroBatches.MicroBatchBuilder

NullOrder

OverwriteFiles

API for overwriting files in a table.

PartitionData

PartitionField

Represents a single field in a PartitionSpec.

Partitioning

PartitionKey

A struct of partition values.

PartitionScanTask

A scan task for data within a particular partition

PartitionSpec

Represents how to produce partition data for a table.

PartitionSpec.Builder

Used to create valid partition specs.

PartitionSpecParser

PartitionsTable

A Table implementation that exposes a table's partitions as rows.

PartitionStatisticsFile

Represents a partition statistics file that can be used to read table data more efficiently.

PartitionStatisticsFileParser

PartitionStats

PartitionStatsHandler

Computes, writes and reads the PartitionStatisticsFile.

PartitionStatsUtil

Deprecated.
since 1.10.0, will be removed in 1.11.0; use PartitionStatsHandler directly

PendingUpdate<T>

API for table metadata changes.

PlanningMode

PositionDeletesScanTask

A ScanTask for position delete files

PositionDeletesTable

A Table implementation whose Scan provides PositionDeletesScanTask, for reading of position delete files.

PositionDeletesTable.PositionDeletesBatchScan

ReachableFileUtil

RefsTable

A Table implementation that exposes a table's known snapshot references as rows.

ReplacePartitions

API for overwriting files in a table by partition.

ReplaceSortOrder

API for replacing table sort order with a newly created order.

RewriteFiles

API for replacing files in a table.

RewriteJobOrder

Enum of supported rewrite job order, it defines the order in which the file groups should be written.

RewriteManifests

API for rewriting manifests for a table.

RewriteTablePathUtil

Utilities for Rewrite table path action.

RewriteTablePathUtil.PositionDeleteReaderWriter

Class providing engine-specific methods to read and write position delete files.

RewriteTablePathUtil.RewriteResult<T>

Rewrite result.

RollingManifestWriter<F extends ContentFile<F>>

As opposed to ManifestWriter, a rolling writer could produce multiple manifest files.

RowDelta

API for encoding row-level changes to a table.

RowLevelOperationMode

Iceberg supports two ways to modify records in a table: copy-on-write and merge-on-read.

Scan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>

Scan objects are immutable and can be shared between threads.

ScanSummary

ScanSummary.Builder

ScanSummary.PartitionMetrics

ScanTask

A scan task.

ScanTaskGroup<T extends ScanTask>

A scan task that may include partial input files, multiple input files or both.

ScanTaskParser

Schema

The schema of a data table.

SchemaParser

SerializableTable

A read-only serializable table that can be sent to other nodes in a cluster.

SerializableTable.SerializableMetadataTable

SetLocation

SetPartitionStatistics

SetStatistics

SingleValueParser

Snapshot

A snapshot of the data in a table at a point in time.

SnapshotIdGeneratorUtil

SnapshotManager

SnapshotParser

SnapshotRef

SnapshotRef.Builder

SnapshotRefParser

SnapshotRefType

SnapshotScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>

This is a common base class to share code between different BaseScan implementations that handle scans of a particular snapshot.

SnapshotsTable

A Table implementation that exposes a table's known snapshots as rows.

SnapshotSummary

SnapshotSummary.Builder

SnapshotUpdate<ThisT>

API for table changes that produce snapshots.

SortDirection

SortField

A field in a SortOrder.

SortKey

A struct of flattened sort field values.

SortOrder

A sort order that defines how data and delete files should be ordered in a table.

SortOrder.Builder

A builder used to create valid sort orders.

SortOrderBuilder<R>

Methods for building a sort order.

SortOrderComparators

SortOrderParser

SparkDistributedDataScan

A batch data scan that can utilize Spark cluster resources for planning.

SplittableScanTask<ThisT>

A scan task that can be split into smaller scan tasks.

StaticTableOperations

TableOperations implementation that provides access to metadata for a Table at some point in time, using a table metadata location.

StatisticsFile

Represents a statistics file in the Puffin format, that can be used to read table data more efficiently.

StatisticsFileParser

StreamingDelete

Delete implementation that avoids loading full manifests in memory.

StructLike

Interface for accessing data by position in a schema.

SystemConfigs

Configuration properties that are controlled by Java system properties or environmental variable.

SystemConfigs.ConfigEntry<T>

SystemProperties

Deprecated.
Use SystemConfigs instead; will be removed in 2.0.0

Table

Represents a table.

TableMetadata

Metadata for a table.

TableMetadata.Builder

TableMetadata.MetadataLogEntry

TableMetadata.SnapshotLogEntry

TableMetadataParser

TableMetadataParser.Codec

TableOperations

SPI interface to abstract table metadata access and updates.

TableProperties

Tables

Generic interface for creating and loading a table implementation.

TableScan

API for configuring a table scan.

TableUtil

Transaction

A transaction for performing multiple updates to a table.

Transactions

UnboundPartitionSpec

UnboundSortOrder

UpdateLocation

API for setting a table's or view's base location.

UpdatePartitionSpec

API for partition spec evolution.

UpdatePartitionStatistics

API for updating partition statistics files in a table.

UpdateProperties

API for updating table properties.

UpdateRequirement

Represents a requirement for a MetadataUpdate

UpdateRequirement.AssertCurrentSchemaID

UpdateRequirement.AssertDefaultSortOrderID

UpdateRequirement.AssertDefaultSpecID

UpdateRequirement.AssertLastAssignedFieldId

UpdateRequirement.AssertLastAssignedPartitionId

UpdateRequirement.AssertRefSnapshotID

UpdateRequirement.AssertTableDoesNotExist

UpdateRequirement.AssertTableUUID

UpdateRequirement.AssertViewUUID

UpdateRequirementParser

UpdateRequirements

UpdateSchema

API for schema evolution.

UpdateStatistics

API for updating statistics files in a table.

Package org.apache.iceberg