Package org.apache.iceberg
Class DataTableScan
java.lang.Object
org.apache.iceberg.SnapshotScan<TableScan,FileScanTask,CombinedScanTask>
  
org.apache.iceberg.DataTableScan
- All Implemented Interfaces:
- Scan<TableScan,,- FileScanTask, - CombinedScanTask> - TableScan
- 
Field SummaryFieldsModifier and TypeFieldDescriptionprotected static final boolean
- 
Constructor SummaryConstructorsModifierConstructorDescriptionprotectedDataTableScan(Table table, Schema schema, org.apache.iceberg.TableScanContext context) 
- 
Method SummaryModifier and TypeMethodDescriptionappendsAfter(long fromSnapshotId) Create a newTableScanto read appended data fromfromSnapshotIdexclusive to the current snapshot inclusive.appendsBetween(long fromSnapshotId, long toSnapshotId) Create a newTableScanto read appended data fromfromSnapshotIdexclusive totoSnapshotIdinclusive.caseSensitive(boolean caseSensitive) Create a new scan from this that, if data columns where selected viaScan.select(java.util.Collection), controls whether the match to the schema will be done with case sensitivity.protected org.apache.iceberg.TableScanContextcontext()filter()Returns this scan's filterExpression.filter(Expression expr) Create a new scan from the results of this filtered by theExpression.Create a new scan from this that applies data filtering to files but not to rows in those files.Create a new scan from this that loads the column stats with each data file.includeColumnStats(Collection<String> requestedColumns) Create a new scan from this that loads the column stats for the specific columns with each data file.protected FileIOio()booleanReturns whether this scan is case-sensitive with respect to column names.metricsReporter(MetricsReporter reporter) Create a new scan that will report scan metrics to the provided reporter in addition to reporters maintained by the scan.protected TableScannewRefinedScan(Table table, Schema schema, org.apache.iceberg.TableScanContext context) Create a new scan from this scan's configuration that will override theTable's behavior based on the incoming pair.options()protected ExecutorServicePlan balanced task groups for this scan by splitting large and combining small tasks.planWith(ExecutorService executorService) Create a new scan to use a particular executor to plan.Create a new scan from this with the schema as its projection.protected Expressionschema()Returns this scan's projectionSchema.select(Collection<String> columns) Create a new scan from this that will read the given data columns.protected booleanprotected booleanprotected booleanintReturns the split lookback for this scan.longReturns the split open file cost for this scan.table()protected SchemalongReturns the target split size for this scan.protected booleanMethods inherited from class org.apache.iceberg.SnapshotScanasOfTime, planFiles, scanMetrics, snapshot, snapshotId, toString, useRef, useSnapshotMethods inherited from class java.lang.Objectclone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, waitMethods inherited from interface org.apache.iceberg.ScancaseSensitive, filter, filter, ignoreResiduals, includeColumnStats, includeColumnStats, isCaseSensitive, metricsReporter, option, planFiles, planWith, project, schema, select, select, splitLookback, splitOpenFileCost, targetSplitSize
- 
Field Details- 
SCAN_COLUMNS
- 
SCAN_WITH_STATS_COLUMNS
- 
DELETE_SCAN_COLUMNS
- 
DELETE_SCAN_WITH_STATS_COLUMNS
- 
PLAN_SCANS_WITH_WORKER_POOLprotected static final boolean PLAN_SCANS_WITH_WORKER_POOL
 
- 
- 
Constructor Details- 
DataTableScan
 
- 
- 
Method Details- 
appendsBetweenDescription copied from interface:TableScanCreate a newTableScanto read appended data fromfromSnapshotIdexclusive totoSnapshotIdinclusive.- Specified by:
- appendsBetweenin interface- TableScan
- Parameters:
- fromSnapshotId- the last snapshot id read by the user, exclusive
- toSnapshotId- read append data up to this snapshot id
- Returns:
- a table scan which can read append data from fromSnapshotIdexclusive and up totoSnapshotIdinclusive
 
- 
appendsAfterDescription copied from interface:TableScanCreate a newTableScanto read appended data fromfromSnapshotIdexclusive to the current snapshot inclusive.- Specified by:
- appendsAfterin interface- TableScan
- Parameters:
- fromSnapshotId- - the last snapshot id read by the user, exclusive
- Returns:
- a table scan which can read append data from fromSnapshotIdexclusive and up to current snapshot inclusive
 
- 
useSnapshotSchemaprotected boolean useSnapshotSchema()- Overrides:
- useSnapshotSchemain class- SnapshotScan<TableScan,- FileScanTask, - CombinedScanTask> 
 
- 
newRefinedScan
- 
doPlanFiles- Specified by:
- doPlanFilesin class- SnapshotScan<TableScan,- FileScanTask, - CombinedScanTask> 
 
- 
planTasksDescription copied from interface:ScanPlan balanced task groups for this scan by splitting large and combining small tasks.Task groups created by this method may read partial input files, multiple input files or both. - Specified by:
- planTasksin interface- Scan<TableScan,- FileScanTask, - CombinedScanTask> 
- Returns:
- an Iterable of balanced task groups required by this scan
 
- 
table
- 
io
- 
tableSchema
- 
contextprotected org.apache.iceberg.TableScanContext context()
- 
options
- 
scanColumns
- 
shouldReturnColumnStatsprotected boolean shouldReturnColumnStats()
- 
columnsToKeepStats
- 
shouldIgnoreResidualsprotected boolean shouldIgnoreResiduals()
- 
residualFilter
- 
shouldPlanWithExecutorprotected boolean shouldPlanWithExecutor()
- 
planExecutor
- 
optionDescription copied from interface:ScanCreate a new scan from this scan's configuration that will override theTable's behavior based on the incoming pair. Unknown properties will be ignored.- Specified by:
- optionin interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
- Parameters:
- property- name of the table property to be overridden
- value- value to override with
- Returns:
- a new scan based on this with overridden behavior
 
- 
projectDescription copied from interface:ScanCreate a new scan from this with the schema as its projection.- Specified by:
- projectin interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
- Parameters:
- projectedSchema- a projection schema
- Returns:
- a new scan based on this with the given projection
 
- 
caseSensitiveDescription copied from interface:ScanCreate a new scan from this that, if data columns where selected viaScan.select(java.util.Collection), controls whether the match to the schema will be done with case sensitivity. Default is true.- Specified by:
- caseSensitivein interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
- Returns:
- a new scan based on this with case sensitivity as stated
 
- 
isCaseSensitivepublic boolean isCaseSensitive()Description copied from interface:ScanReturns whether this scan is case-sensitive with respect to column names.- Specified by:
- isCaseSensitivein interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
- Returns:
- true if case-sensitive, false otherwise.
 
- 
includeColumnStatsDescription copied from interface:ScanCreate a new scan from this that loads the column stats with each data file.Column stats include: value count, null value count, lower bounds, and upper bounds. - Specified by:
- includeColumnStatsin interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
- Returns:
- a new scan based on this that loads column stats.
 
- 
includeColumnStatsDescription copied from interface:ScanCreate a new scan from this that loads the column stats for the specific columns with each data file.Column stats include: value count, null value count, lower bounds, and upper bounds. - Specified by:
- includeColumnStatsin interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
- Parameters:
- requestedColumns- column names for which to keep the stats.
- Returns:
- a new scan based on this that loads column stats for specific columns.
 
- 
selectDescription copied from interface:ScanCreate a new scan from this that will read the given data columns. This produces an expected schema that includes all fields that are either selected or used by this scan's filter expression.- Specified by:
- selectin interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
- Parameters:
- columns- column names from the table's schema
- Returns:
- a new scan based on this with the given projection columns
 
- 
filterDescription copied from interface:ScanCreate a new scan from the results of this filtered by theExpression.- Specified by:
- filterin interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
- Parameters:
- expr- a filter expression
- Returns:
- a new scan based on this with results filtered by the expression
 
- 
filterDescription copied from interface:ScanReturns this scan's filterExpression.- Specified by:
- filterin interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
- Returns:
- this scan's filter expression
 
- 
ignoreResidualsDescription copied from interface:ScanCreate a new scan from this that applies data filtering to files but not to rows in those files.- Specified by:
- ignoreResidualsin interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
- Returns:
- a new scan based on this that does not filter rows in files.
 
- 
planWithDescription copied from interface:ScanCreate a new scan to use a particular executor to plan. The default worker pool will be used by default.- Specified by:
- planWithin interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
- Parameters:
- executorService- the provided executor
- Returns:
- a table scan that uses the provided executor to access manifests
 
- 
schemaDescription copied from interface:ScanReturns this scan's projectionSchema.If the projection schema was set directly using Scan.project(Schema), returns that schema.If the projection schema was set by calling Scan.select(Collection), returns a projection schema that includes the selected data fields and any fields used in the filter expression.- Specified by:
- schemain interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
- Returns:
- this scan's projection schema
 
- 
targetSplitSizepublic long targetSplitSize()Description copied from interface:ScanReturns the target split size for this scan.- Specified by:
- targetSplitSizein interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
 
- 
splitLookbackpublic int splitLookback()Description copied from interface:ScanReturns the split lookback for this scan.- Specified by:
- splitLookbackin interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
 
- 
splitOpenFileCostpublic long splitOpenFileCost()Description copied from interface:ScanReturns the split open file cost for this scan.- Specified by:
- splitOpenFileCostin interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
 
- 
metricsReporterDescription copied from interface:ScanCreate a new scan that will report scan metrics to the provided reporter in addition to reporters maintained by the scan.- Specified by:
- metricsReporterin interface- Scan<ThisT,- T extends ScanTask, - G extends ScanTaskGroup<T>> 
 
 
-