Package org.apache.iceberg.spark
Class SparkReadConf
- java.lang.Object
- 
- org.apache.iceberg.spark.SparkReadConf
 
- 
 public class SparkReadConf extends java.lang.ObjectA class for common Iceberg configs for Spark reads.If a config is set at multiple levels, the following order of precedence is used (top to bottom): - Read options
- Session configuration
- Table metadata
 Note this class is NOT meant to be serialized and sent to executors. 
- 
- 
Constructor SummaryConstructors Constructor Description SparkReadConf(org.apache.spark.sql.SparkSession spark, Table table, java.util.Map<java.lang.String,java.lang.String> readOptions)
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description java.lang.LongasOfTimestamp()booleancaseSensitive()java.lang.LongendSnapshotId()java.lang.StringfileScanTaskSetId()booleanhandleTimestampWithoutZone()Enables reading a timestamp without time zone as a timestamp with time zone.booleanlocalityEnabled()intorcBatchSize()booleanorcVectorizationEnabled()intparquetBatchSize()booleanparquetVectorizationEnabled()java.lang.LongsnapshotId()intsplitLookback()java.lang.IntegersplitLookbackOption()longsplitOpenFileCost()java.lang.LongsplitOpenFileCostOption()longsplitSize()java.lang.LongsplitSizeOption()java.lang.LongstartSnapshotId()java.lang.LongstreamFromTimestamp()booleanstreamingSkipDeleteSnapshots()booleanstreamingSkipOverwriteSnapshots()
 
- 
- 
- 
Constructor Detail- 
SparkReadConfpublic SparkReadConf(org.apache.spark.sql.SparkSession spark, Table table, java.util.Map<java.lang.String,java.lang.String> readOptions)
 
- 
 - 
Method Detail- 
caseSensitivepublic boolean caseSensitive() 
 - 
localityEnabledpublic boolean localityEnabled() 
 - 
snapshotIdpublic java.lang.Long snapshotId() 
 - 
asOfTimestamppublic java.lang.Long asOfTimestamp() 
 - 
startSnapshotIdpublic java.lang.Long startSnapshotId() 
 - 
endSnapshotIdpublic java.lang.Long endSnapshotId() 
 - 
fileScanTaskSetIdpublic java.lang.String fileScanTaskSetId() 
 - 
streamingSkipDeleteSnapshotspublic boolean streamingSkipDeleteSnapshots() 
 - 
streamingSkipOverwriteSnapshotspublic boolean streamingSkipOverwriteSnapshots() 
 - 
parquetVectorizationEnabledpublic boolean parquetVectorizationEnabled() 
 - 
parquetBatchSizepublic int parquetBatchSize() 
 - 
orcVectorizationEnabledpublic boolean orcVectorizationEnabled() 
 - 
orcBatchSizepublic int orcBatchSize() 
 - 
splitSizeOptionpublic java.lang.Long splitSizeOption() 
 - 
splitSizepublic long splitSize() 
 - 
splitLookbackOptionpublic java.lang.Integer splitLookbackOption() 
 - 
splitLookbackpublic int splitLookback() 
 - 
splitOpenFileCostOptionpublic java.lang.Long splitOpenFileCostOption() 
 - 
splitOpenFileCostpublic long splitOpenFileCost() 
 - 
handleTimestampWithoutZonepublic boolean handleTimestampWithoutZone() Enables reading a timestamp without time zone as a timestamp with time zone.Generally, this is not safe as a timestamp without time zone is supposed to represent the wall-clock time, i.e. no matter the reader/writer timezone 3PM should always be read as 3PM, but a timestamp with time zone represents instant semantics, i.e. the timestamp is adjusted so that the corresponding time in the reader timezone is displayed. When set to false (default), an exception must be thrown while reading a timestamp without time zone. - Returns:
- boolean indicating if reading timestamps without timezone is allowed
 
 - 
streamFromTimestamppublic java.lang.Long streamFromTimestamp() 
 
- 
 
-