Class DeleteFilesTable.DeleteFilesTableScan
- java.lang.Object
-
- org.apache.iceberg.DeleteFilesTable.DeleteFilesTableScan
-
- All Implemented Interfaces:
Scan<TableScan,FileScanTask,CombinedScanTask>
,TableScan
- Enclosing class:
- DeleteFilesTable
public static class DeleteFilesTable.DeleteFilesTableScan extends java.lang.Object
-
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description TableScan
appendsAfter(long fromSnapshotId)
Create a newTableScan
to read appended data fromfromSnapshotId
exclusive to the current snapshot inclusive.TableScan
appendsBetween(long fromSnapshotId, long toSnapshotId)
Create a newTableScan
to read appended data fromfromSnapshotId
exclusive totoSnapshotId
inclusive.TableScan
asOfTime(long timestampMillis)
Create a newTableScan
from this scan's configuration that will use the most recent snapshot as of the given time in milliseconds.ThisT
caseSensitive(boolean caseSensitive)
Create a new scan from this that, if data columns where selected viaScan.select(java.util.Collection)
, controls whether the match to the schema will be done with case sensitivity.protected boolean
colStats()
protected org.apache.iceberg.TableScanContext
context()
protected CloseableIterable<FileScanTask>
doPlanFiles()
Expression
filter()
Returns this scan's filterExpression
.ThisT
filter(Expression expr)
Create a new scan from the results of this filtered by theExpression
.ThisT
ignoreResiduals()
Create a new scan from this that applies data filtering to files but not to rows in those files.ThisT
includeColumnStats()
Create a new scan from this that loads the column stats with each data file.boolean
isCaseSensitive()
Returns whether this scan should apply column name case sensitiveness as perScan.caseSensitive(boolean)
.protected CloseableIterable<ManifestFile>
manifests()
Returns an iterable of manifest files to explore for this files metadata table scanprotected TableScan
newRefinedScan(TableOperations ops, Table table, Schema schema, org.apache.iceberg.TableScanContext context)
ThisT
option(java.lang.String property, java.lang.String value)
Create a new scan from this scan's configuration that will override theTable
's behavior based on the incoming pair.protected java.util.Map<java.lang.String,java.lang.String>
options()
protected java.util.concurrent.ExecutorService
planExecutor()
CloseableIterable<FileScanTask>
planFiles()
Plan tasks for this scan where each task reads a single file.CloseableIterable<CombinedScanTask>
planTasks()
Plan balanced task groups for this scan by splitting large and combining small tasks.ThisT
planWith(java.util.concurrent.ExecutorService executorService)
Create a new scan to use a particular executor to plan.ThisT
project(Schema projectedSchema)
Create a new scan from this with the schema as its projection.Schema
schema()
Returns this scan's projectionSchema
.ThisT
select(java.util.Collection<java.lang.String> columns)
Create a new scan from this that will read the given data columns.protected boolean
shouldIgnoreResiduals()
Snapshot
snapshot()
Returns theSnapshot
that will be used by this scan.protected java.lang.Long
snapshotId()
int
splitLookback()
Returns the split lookback for this scan.long
splitOpenFileCost()
Returns the split open file cost for this scan.Table
table()
Returns theTable
from which this scan loads data.protected TableOperations
tableOps()
protected Schema
tableSchema()
protected MetadataTableType
tableType()
Type of scan being performed, such asMetadataTableType.ALL_DATA_FILES
when scanning a table'sAllDataFilesTable
.long
targetSplitSize()
Returns the target split size for this scan.java.lang.String
toString()
TableScan
useSnapshot(long scanSnapshotId)
Create a newTableScan
from this scan's configuration that will use the given snapshot by ID.-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
-
Methods inherited from interface org.apache.iceberg.Scan
caseSensitive, filter, ignoreResiduals, includeColumnStats, option, planWith, project, schema, select, splitLookback, splitOpenFileCost
-
-
-
-
Method Detail
-
newRefinedScan
protected TableScan newRefinedScan(TableOperations ops, Table table, Schema schema, org.apache.iceberg.TableScanContext context)
-
manifests
protected CloseableIterable<ManifestFile> manifests()
Returns an iterable of manifest files to explore for this files metadata table scan
-
doPlanFiles
protected CloseableIterable<FileScanTask> doPlanFiles()
-
tableType
protected MetadataTableType tableType()
Type of scan being performed, such asMetadataTableType.ALL_DATA_FILES
when scanning a table'sAllDataFilesTable
.Used for logging and error messages.
-
appendsBetween
public TableScan appendsBetween(long fromSnapshotId, long toSnapshotId)
Description copied from interface:TableScan
Create a newTableScan
to read appended data fromfromSnapshotId
exclusive totoSnapshotId
inclusive.- Specified by:
appendsBetween
in interfaceTableScan
- Parameters:
fromSnapshotId
- the last snapshot id read by the user, exclusivetoSnapshotId
- read append data up to this snapshot id- Returns:
- a table scan which can read append data from
fromSnapshotId
exclusive and up totoSnapshotId
inclusive
-
appendsAfter
public TableScan appendsAfter(long fromSnapshotId)
Description copied from interface:TableScan
Create a newTableScan
to read appended data fromfromSnapshotId
exclusive to the current snapshot inclusive.- Specified by:
appendsAfter
in interfaceTableScan
- Parameters:
fromSnapshotId
- - the last snapshot id read by the user, exclusive- Returns:
- a table scan which can read append data from
fromSnapshotId
exclusive and up to current snapshot inclusive
-
targetSplitSize
public long targetSplitSize()
Description copied from interface:Scan
Returns the target split size for this scan.- Specified by:
targetSplitSize
in interfaceScan<TableScan,FileScanTask,CombinedScanTask>
-
snapshotId
protected java.lang.Long snapshotId()
-
colStats
protected boolean colStats()
-
shouldIgnoreResiduals
protected boolean shouldIgnoreResiduals()
-
planExecutor
protected java.util.concurrent.ExecutorService planExecutor()
-
options
protected java.util.Map<java.lang.String,java.lang.String> options()
-
table
public Table table()
Description copied from interface:TableScan
Returns theTable
from which this scan loads data.
-
useSnapshot
public TableScan useSnapshot(long scanSnapshotId)
Description copied from interface:TableScan
Create a newTableScan
from this scan's configuration that will use the given snapshot by ID.- Specified by:
useSnapshot
in interfaceTableScan
- Parameters:
scanSnapshotId
- a snapshot ID- Returns:
- a new scan based on this with the given snapshot ID
-
asOfTime
public TableScan asOfTime(long timestampMillis)
Description copied from interface:TableScan
Create a newTableScan
from this scan's configuration that will use the most recent snapshot as of the given time in milliseconds.
-
filter
public Expression filter()
Description copied from interface:TableScan
Returns this scan's filterExpression
.
-
planFiles
public CloseableIterable<FileScanTask> planFiles()
Description copied from interface:Scan
Plan tasks for this scan where each task reads a single file.Use
Scan.planTasks()
for planning balanced tasks where each task will read either a single file, a part of a file, or multiple files.- Specified by:
planFiles
in interfaceScan<TableScan,FileScanTask,CombinedScanTask>
- Returns:
- an Iterable of tasks scanning entire files required by this scan
-
planTasks
public CloseableIterable<CombinedScanTask> planTasks()
Description copied from interface:Scan
Plan balanced task groups for this scan by splitting large and combining small tasks.Task groups created by this method may read partial input files, multiple input files or both.
- Specified by:
planTasks
in interfaceScan<TableScan,FileScanTask,CombinedScanTask>
- Returns:
- an Iterable of balanced task groups required by this scan
-
snapshot
public Snapshot snapshot()
Description copied from interface:TableScan
Returns theSnapshot
that will be used by this scan.If the snapshot was not configured using
TableScan.asOfTime(long)
orTableScan.useSnapshot(long)
, the current table snapshot will be used.
-
isCaseSensitive
public boolean isCaseSensitive()
Description copied from interface:TableScan
Returns whether this scan should apply column name case sensitiveness as perScan.caseSensitive(boolean)
.- Specified by:
isCaseSensitive
in interfaceTableScan
- Returns:
- true if case sensitive, false otherwise.
-
toString
public java.lang.String toString()
- Overrides:
toString
in classjava.lang.Object
-
tableOps
protected TableOperations tableOps()
-
tableSchema
protected Schema tableSchema()
-
context
protected org.apache.iceberg.TableScanContext context()
-
option
public ThisT option(java.lang.String property, java.lang.String value)
Description copied from interface:Scan
Create a new scan from this scan's configuration that will override theTable
's behavior based on the incoming pair. Unknown properties will be ignored.- Specified by:
option
in interfaceScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>
- Parameters:
property
- name of the table property to be overriddenvalue
- value to override with- Returns:
- a new scan based on this with overridden behavior
-
project
public ThisT project(Schema projectedSchema)
Description copied from interface:Scan
Create a new scan from this with the schema as its projection.- Specified by:
project
in interfaceScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>
- Parameters:
projectedSchema
- a projection schema- Returns:
- a new scan based on this with the given projection
-
caseSensitive
public ThisT caseSensitive(boolean caseSensitive)
Description copied from interface:Scan
Create a new scan from this that, if data columns where selected viaScan.select(java.util.Collection)
, controls whether the match to the schema will be done with case sensitivity. Default is true.- Specified by:
caseSensitive
in interfaceScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>
- Returns:
- a new scan based on this with case sensitivity as stated
-
includeColumnStats
public ThisT includeColumnStats()
Description copied from interface:Scan
Create a new scan from this that loads the column stats with each data file.Column stats include: value count, null value count, lower bounds, and upper bounds.
- Specified by:
includeColumnStats
in interfaceScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>
- Returns:
- a new scan based on this that loads column stats.
-
select
public ThisT select(java.util.Collection<java.lang.String> columns)
Description copied from interface:Scan
Create a new scan from this that will read the given data columns. This produces an expected schema that includes all fields that are either selected or used by this scan's filter expression.- Specified by:
select
in interfaceScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>
- Parameters:
columns
- column names from the table's schema- Returns:
- a new scan based on this with the given projection columns
-
filter
public ThisT filter(Expression expr)
Description copied from interface:Scan
Create a new scan from the results of this filtered by theExpression
.- Specified by:
filter
in interfaceScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>
- Parameters:
expr
- a filter expression- Returns:
- a new scan based on this with results filtered by the expression
-
ignoreResiduals
public ThisT ignoreResiduals()
Description copied from interface:Scan
Create a new scan from this that applies data filtering to files but not to rows in those files.- Specified by:
ignoreResiduals
in interfaceScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>
- Returns:
- a new scan based on this that does not filter rows in files.
-
planWith
public ThisT planWith(java.util.concurrent.ExecutorService executorService)
Description copied from interface:Scan
Create a new scan to use a particular executor to plan. The default worker pool will be used by default.- Specified by:
planWith
in interfaceScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>
- Parameters:
executorService
- the provided executor- Returns:
- a table scan that uses the provided executor to access manifests
-
schema
public Schema schema()
Description copied from interface:Scan
Returns this scan's projectionSchema
.If the projection schema was set directly using
Scan.project(Schema)
, returns that schema.If the projection schema was set by calling
Scan.select(Collection)
, returns a projection schema that includes the selected data fields and any fields used in the filter expression.- Specified by:
schema
in interfaceScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>
- Returns:
- this scan's projection schema
-
splitLookback
public int splitLookback()
Description copied from interface:Scan
Returns the split lookback for this scan.- Specified by:
splitLookback
in interfaceScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>
-
splitOpenFileCost
public long splitOpenFileCost()
Description copied from interface:Scan
Returns the split open file cost for this scan.- Specified by:
splitOpenFileCost
in interfaceScan<ThisT,T extends ScanTask,G extends ScanTaskGroup<T>>
-
-