public interface DeletedRowsScanTask extends ChangelogScanTask, ContentScanTask<DataFile>
Suppose snapshot S1 contains data files F1, F2, F3. Then snapshot S2 adds a position delete file, D1, that deletes records from F2 and snapshot S3 adds an equality delete file, D2, that removes records from F1, F2, F3. A scan for changes from S2 to S3 (inclusive) should include the following tasks:
Readers consuming these tasks should produce deleted records with metadata like change ordinal and commit snapshot ID.
Modifier and Type | Method and Description |
---|---|
java.util.List<DeleteFile> |
addedDeletes()
A list of added
delete files that apply to the task's data file. |
java.util.List<DeleteFile> |
existingDeletes()
A list of
delete files that existed before and must be applied prior to
determining which records are deleted by delete files in addedDeletes() . |
default int |
filesCount()
The number of files that will be opened by this scan task.
|
default ChangelogOperation |
operation()
Returns the type of changes produced by this task (i.e.
|
default long |
sizeBytes()
The number of bytes that should be read by this scan task.
|
changeOrdinal, commitSnapshotId
estimatedRowsCount, file, length, partition, residual, start
spec
asCombinedScanTask, asDataTask, asFileScanTask, isDataTask, isFileScanTask
java.util.List<DeleteFile> addedDeletes()
delete files
that apply to the task's data file. Records
removed by these delete files should appear as deletes in the changelog.java.util.List<DeleteFile> existingDeletes()
delete files
that existed before and must be applied prior to
determining which records are deleted by delete files in addedDeletes()
. Records
removed by these delete files should not appear in the changelog.default ChangelogOperation operation()
ChangelogScanTask
operation
in interface ChangelogScanTask
default long sizeBytes()
ScanTask
default int filesCount()
ScanTask
filesCount
in interface ScanTask