Interface AddedRowsScanTask

  • All Superinterfaces:
    ChangelogScanTask, ContentScanTask<DataFile>, ScanTask, java.io.Serializable

    public interface AddedRowsScanTask
    extends ChangelogScanTask, ContentScanTask<DataFile>
    A scan task for inserts generated by adding a data file to the table.

    Note that added data files may have matching delete files. This may happen if a matching position delete file is committed in the same snapshot or if changes for multiple snapshots are squashed together.

    Suppose snapshot S1 adds data files F1, F2, F3 and a position delete file, D1, that marks particular records in F1 as deleted. A scan for changes generated by S1 should include the following tasks:

    • AddedRowsScanTask(file=F1, deletes=[D1], snapshot=S1)
    • AddedRowsScanTask(file=F2, deletes=[], snapshot=S1)
    • AddedRowsScanTask(file=F3, deletes=[], snapshot=S1)

    Readers consuming these tasks should produce added records with metadata like change ordinal and commit snapshot ID.