Class SparkDataFile

All Implemented Interfaces:
ContentFile<DataFile>, DataFile

public class SparkDataFile extends SparkContentFile<DataFile> implements DataFile
  • Constructor Details

    • SparkDataFile

      public SparkDataFile(Types.StructType type, org.apache.spark.sql.types.StructType sparkType)
    • SparkDataFile

      public SparkDataFile(Types.StructType type, Types.StructType projectedType, org.apache.spark.sql.types.StructType sparkType)
  • Method Details

    • asFile

      protected DataFile asFile()
      Specified by:
      asFile in class SparkContentFile<DataFile>
    • equalityFieldIds

      public List<Integer> equalityFieldIds()
      Description copied from interface: ContentFile
      Returns the set of field IDs used for equality comparison, in equality delete files.

      An equality delete file may contain additional data fields that are not used by equality comparison. The subset of columns in a delete file to be used in equality comparison are tracked by ID. Extra columns can be used to reconstruct changes and metrics from extra columns are used during job planning.

      Specified by:
      equalityFieldIds in interface ContentFile<DataFile>
      Specified by:
      equalityFieldIds in interface DataFile
      Overrides:
      equalityFieldIds in class SparkContentFile<DataFile>
      Returns:
      IDs of the fields used in equality comparison with the records in this delete file