Interface SplittableScanTask<ThisT>

  • Type Parameters:
    ThisT - the child Java API class
    All Superinterfaces:
    ScanTask, java.io.Serializable
    All Known Subinterfaces:
    DataTask, FileScanTask
    All Known Implementing Classes:
    BaseFileScanTask

    public interface SplittableScanTask<ThisT>
    extends ScanTask
    A scan task that can be split into smaller scan tasks.
    • Method Detail

      • split

        java.lang.Iterable<ThisT> split​(long targetSplitSize)
        Attempts to split this scan task into several smaller scan tasks, each close to splitSize size.

        Note the target split size is just guidance and the actual split size may be either smaller or larger. File formats like Parquet may leverage the row group offset information while splitting tasks.

        Parameters:
        targetSplitSize - the target size of each new scan task in bytes
        Returns:
        an Iterable of smaller tasks