public class SparkReadConf
extends java.lang.Object
If a config is set at multiple levels, the following order of precedence is used (top to bottom):
Note this class is NOT meant to be serialized and sent to executors.
Constructor and Description |
---|
SparkReadConf(org.apache.spark.sql.SparkSession spark,
Table table,
java.util.Map<java.lang.String,java.lang.String> readOptions) |
SparkReadConf(org.apache.spark.sql.SparkSession spark,
Table table,
java.lang.String branch,
java.util.Map<java.lang.String,java.lang.String> readOptions) |
Modifier and Type | Method and Description |
---|---|
boolean |
aggregatePushDownEnabled() |
java.lang.Long |
asOfTimestamp() |
java.lang.String |
branch() |
boolean |
caseSensitive() |
java.lang.Long |
endSnapshotId() |
java.lang.Long |
endTimestamp() |
java.lang.String |
fileScanTaskSetId()
Deprecated.
will be removed in 1.3.0, use
scanTaskSetId() instead |
boolean |
handleTimestampWithoutZone()
Enables reading a timestamp without time zone as a timestamp with time zone.
|
boolean |
localityEnabled() |
int |
orcBatchSize() |
boolean |
orcVectorizationEnabled() |
int |
parquetBatchSize() |
boolean |
parquetVectorizationEnabled() |
boolean |
preserveDataGrouping() |
java.lang.String |
scanTaskSetId() |
java.lang.Long |
snapshotId() |
int |
splitLookback() |
java.lang.Integer |
splitLookbackOption() |
long |
splitOpenFileCost() |
java.lang.Long |
splitOpenFileCostOption() |
long |
splitSize() |
java.lang.Long |
splitSizeOption() |
java.lang.Long |
startSnapshotId() |
java.lang.Long |
startTimestamp() |
java.lang.Long |
streamFromTimestamp() |
boolean |
streamingSkipDeleteSnapshots() |
boolean |
streamingSkipOverwriteSnapshots() |
java.lang.String |
tag() |
public SparkReadConf(org.apache.spark.sql.SparkSession spark, Table table, java.util.Map<java.lang.String,java.lang.String> readOptions)
public SparkReadConf(org.apache.spark.sql.SparkSession spark, Table table, java.lang.String branch, java.util.Map<java.lang.String,java.lang.String> readOptions)
public boolean caseSensitive()
public boolean localityEnabled()
public java.lang.Long snapshotId()
public java.lang.Long asOfTimestamp()
public java.lang.Long startSnapshotId()
public java.lang.Long endSnapshotId()
public java.lang.String branch()
public java.lang.String tag()
@Deprecated public java.lang.String fileScanTaskSetId()
scanTaskSetId()
insteadpublic java.lang.String scanTaskSetId()
public boolean streamingSkipDeleteSnapshots()
public boolean streamingSkipOverwriteSnapshots()
public boolean parquetVectorizationEnabled()
public int parquetBatchSize()
public boolean orcVectorizationEnabled()
public int orcBatchSize()
public java.lang.Long splitSizeOption()
public long splitSize()
public java.lang.Integer splitLookbackOption()
public int splitLookback()
public java.lang.Long splitOpenFileCostOption()
public long splitOpenFileCost()
public boolean handleTimestampWithoutZone()
Generally, this is not safe as a timestamp without time zone is supposed to represent the wall-clock time, i.e. no matter the reader/writer timezone 3PM should always be read as 3PM, but a timestamp with time zone represents instant semantics, i.e. the timestamp is adjusted so that the corresponding time in the reader timezone is displayed.
When set to false (default), an exception must be thrown while reading a timestamp without time zone.
public java.lang.Long streamFromTimestamp()
public java.lang.Long startTimestamp()
public java.lang.Long endTimestamp()
public boolean preserveDataGrouping()
public boolean aggregatePushDownEnabled()