Interface SplittableScanTask<ThisT>

Type Parameters:
ThisT - the child Java API class
All Superinterfaces:
ScanTask, Serializable
All Known Subinterfaces:
DataTask, FileScanTask
All Known Implementing Classes:
BaseFileScanTask

public interface SplittableScanTask<ThisT> extends ScanTask
A scan task that can be split into smaller scan tasks.
  • Method Details

    • split

      Iterable<ThisT> split(long targetSplitSize)
      Attempts to split this scan task into several smaller scan tasks, each close to splitSize size.

      Note the target split size is just guidance and the actual split size may be either smaller or larger. File formats like Parquet may leverage the row group offset information while splitting tasks.

      Parameters:
      targetSplitSize - the target size of each new scan task in bytes
      Returns:
      an Iterable of smaller tasks