Package org.apache.iceberg.spark
Class SparkCatalog
java.lang.Object
org.apache.iceberg.spark.SparkCatalog
- All Implemented Interfaces:
- HasIcebergCatalog,- SupportsReplaceView,- org.apache.spark.sql.connector.catalog.CatalogPlugin,- org.apache.spark.sql.connector.catalog.FunctionCatalog,- org.apache.spark.sql.connector.catalog.StagingTableCatalog,- org.apache.spark.sql.connector.catalog.SupportsNamespaces,- org.apache.spark.sql.connector.catalog.TableCatalog,- org.apache.spark.sql.connector.catalog.ViewCatalog,- ProcedureCatalog
A Spark TableCatalog implementation that wraps an Iceberg 
Catalog.
 This supports the following catalog configuration options:
- type- catalog type, "hive" or "hadoop" or "rest". To specify a non-hive or hadoop catalog, use the- catalog-imploption.
- uri- the Hive Metastore URI for Hive catalog or REST URI for REST catalog
- warehouse- the warehouse path (Hadoop catalog only)
- catalog-impl- a custom- Catalogimplementation to use
- io-impl- a custom- FileIOimplementation to use
- metrics-reporter-impl- a custom- MetricsReporterimplementation to use
- default-namespace- a namespace to use as the default
- cache-enabled- whether to enable catalog cache
- cache.case-sensitive- whether the catalog cache should compare table identifiers in a case sensitive way
- cache.expiration-interval-ms- interval in millis before expiring tables from catalog cache. Refer to- CatalogProperties.CACHE_EXPIRATION_INTERVAL_MSfor further details and significant values.
- table-default.$tablePropertyKey- table property $tablePropertyKey default at catalog level
- table-override.$tablePropertyKey- table property $tablePropertyKey enforced at catalog level
- 
Field SummaryFields inherited from interface org.apache.spark.sql.connector.catalog.SupportsNamespacesPROP_COMMENT, PROP_LOCATION, PROP_OWNERFields inherited from interface org.apache.spark.sql.connector.catalog.TableCatalogOPTION_PREFIX, PROP_COMMENT, PROP_EXTERNAL, PROP_IS_MANAGED_LOCATION, PROP_LOCATION, PROP_OWNER, PROP_PROVIDERFields inherited from interface org.apache.spark.sql.connector.catalog.ViewCatalogPROP_COMMENT, PROP_CREATE_ENGINE_VERSION, PROP_ENGINE_VERSION, PROP_OWNER, RESERVED_PROPERTIES
- 
Constructor SummaryConstructors
- 
Method SummaryModifier and TypeMethodDescriptionvoidalterNamespace(String[] namespace, org.apache.spark.sql.connector.catalog.NamespaceChange... changes) org.apache.spark.sql.connector.catalog.TablealterTable(org.apache.spark.sql.connector.catalog.Identifier ident, org.apache.spark.sql.connector.catalog.TableChange... changes) org.apache.spark.sql.connector.catalog.ViewalterView(org.apache.spark.sql.connector.catalog.Identifier ident, org.apache.spark.sql.connector.catalog.ViewChange... changes) protected CatalogbuildIcebergCatalog(String name, org.apache.spark.sql.util.CaseInsensitiveStringMap options) Build an IcebergCatalogto be used by this Spark catalog adapter.protected TableIdentifierbuildIdentifier(org.apache.spark.sql.connector.catalog.Identifier identifier) Build an IcebergTableIdentifierfor the given Spark identifier.voidcreateNamespace(String[] namespace, Map<String, String> metadata) org.apache.spark.sql.connector.catalog.TablecreateTable(org.apache.spark.sql.connector.catalog.Identifier ident, org.apache.spark.sql.types.StructType schema, org.apache.spark.sql.connector.expressions.Transform[] transforms, Map<String, String> properties) org.apache.spark.sql.connector.catalog.ViewcreateView(org.apache.spark.sql.connector.catalog.Identifier ident, String sql, String currentCatalog, String[] currentNamespace, org.apache.spark.sql.types.StructType schema, String[] queryColumnNames, String[] columnAliases, String[] columnComments, Map<String, String> properties) String[]booleandropNamespace(String[] namespace, boolean cascade) booleandropTable(org.apache.spark.sql.connector.catalog.Identifier ident) booleandropView(org.apache.spark.sql.connector.catalog.Identifier ident) Returns the underlyingCatalogbacking this Spark Catalogfinal voidinitialize(String name, org.apache.spark.sql.util.CaseInsensitiveStringMap options) voidinvalidateTable(org.apache.spark.sql.connector.catalog.Identifier ident) booleanisExistingNamespace(String[] namespace) booleanisFunctionNamespace(String[] namespace) default org.apache.spark.sql.connector.catalog.Identifier[]listFunctions(String[] namespace) String[][]String[][]listNamespaces(String[] namespace) org.apache.spark.sql.connector.catalog.Identifier[]listTables(String[] namespace) org.apache.spark.sql.connector.catalog.Identifier[]default org.apache.spark.sql.connector.catalog.functions.UnboundFunctionloadFunction(org.apache.spark.sql.connector.catalog.Identifier ident) loadNamespaceMetadata(String[] namespace) loadProcedure(org.apache.spark.sql.connector.catalog.Identifier ident) Load astored procedurebyidentifier.org.apache.spark.sql.connector.catalog.TableloadTable(org.apache.spark.sql.connector.catalog.Identifier ident) org.apache.spark.sql.connector.catalog.TableloadTable(org.apache.spark.sql.connector.catalog.Identifier ident, long timestamp) org.apache.spark.sql.connector.catalog.Tableorg.apache.spark.sql.connector.catalog.ViewloadView(org.apache.spark.sql.connector.catalog.Identifier ident) name()booleanpurgeTable(org.apache.spark.sql.connector.catalog.Identifier ident) voidrenameTable(org.apache.spark.sql.connector.catalog.Identifier from, org.apache.spark.sql.connector.catalog.Identifier to) voidrenameView(org.apache.spark.sql.connector.catalog.Identifier fromIdentifier, org.apache.spark.sql.connector.catalog.Identifier toIdentifier) org.apache.spark.sql.connector.catalog.ViewreplaceView(org.apache.spark.sql.connector.catalog.Identifier ident, String sql, String currentCatalog, String[] currentNamespace, org.apache.spark.sql.types.StructType schema, String[] queryColumnNames, String[] columnAliases, String[] columnComments, Map<String, String> properties) Replace a view in the catalogorg.apache.spark.sql.connector.catalog.StagedTablestageCreate(org.apache.spark.sql.connector.catalog.Identifier ident, org.apache.spark.sql.types.StructType schema, org.apache.spark.sql.connector.expressions.Transform[] transforms, Map<String, String> properties) org.apache.spark.sql.connector.catalog.StagedTablestageCreateOrReplace(org.apache.spark.sql.connector.catalog.Identifier ident, org.apache.spark.sql.types.StructType schema, org.apache.spark.sql.connector.expressions.Transform[] transforms, Map<String, String> properties) org.apache.spark.sql.connector.catalog.StagedTablestageReplace(org.apache.spark.sql.connector.catalog.Identifier ident, org.apache.spark.sql.types.StructType schema, org.apache.spark.sql.connector.expressions.Transform[] transforms, Map<String, String> properties) booleanMethods inherited from class java.lang.Objectclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.apache.spark.sql.connector.catalog.FunctionCatalogfunctionExistsMethods inherited from interface org.apache.spark.sql.connector.catalog.StagingTableCatalogstageCreate, stageCreateOrReplace, stageReplaceMethods inherited from interface org.apache.spark.sql.connector.catalog.SupportsNamespacesnamespaceExistsMethods inherited from interface org.apache.spark.sql.connector.catalog.TableCatalogcapabilities, createTable, loadTable, tableExistsMethods inherited from interface org.apache.spark.sql.connector.catalog.ViewCataloginvalidateView, viewExists
- 
Constructor Details- 
SparkCatalogpublic SparkCatalog()
 
- 
- 
Method Details- 
buildIcebergCatalogprotected Catalog buildIcebergCatalog(String name, org.apache.spark.sql.util.CaseInsensitiveStringMap options) Build an IcebergCatalogto be used by this Spark catalog adapter.- Parameters:
- name- Spark's catalog name
- options- Spark's catalog options
- Returns:
- an Iceberg catalog
 
- 
buildIdentifierprotected TableIdentifier buildIdentifier(org.apache.spark.sql.connector.catalog.Identifier identifier) Build an IcebergTableIdentifierfor the given Spark identifier.- Parameters:
- identifier- Spark's identifier
- Returns:
- an Iceberg identifier
 
- 
loadTablepublic org.apache.spark.sql.connector.catalog.Table loadTable(org.apache.spark.sql.connector.catalog.Identifier ident) throws org.apache.spark.sql.catalyst.analysis.NoSuchTableException - Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchTableException
 
- 
loadTablepublic org.apache.spark.sql.connector.catalog.Table loadTable(org.apache.spark.sql.connector.catalog.Identifier ident, String version) throws org.apache.spark.sql.catalyst.analysis.NoSuchTableException - Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchTableException
 
- 
loadTablepublic org.apache.spark.sql.connector.catalog.Table loadTable(org.apache.spark.sql.connector.catalog.Identifier ident, long timestamp) throws org.apache.spark.sql.catalyst.analysis.NoSuchTableException - Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchTableException
 
- 
createTablepublic org.apache.spark.sql.connector.catalog.Table createTable(org.apache.spark.sql.connector.catalog.Identifier ident, org.apache.spark.sql.types.StructType schema, org.apache.spark.sql.connector.expressions.Transform[] transforms, Map<String, String> properties) throws org.apache.spark.sql.catalyst.analysis.TableAlreadyExistsException- Throws:
- org.apache.spark.sql.catalyst.analysis.TableAlreadyExistsException
 
- 
stageCreatepublic org.apache.spark.sql.connector.catalog.StagedTable stageCreate(org.apache.spark.sql.connector.catalog.Identifier ident, org.apache.spark.sql.types.StructType schema, org.apache.spark.sql.connector.expressions.Transform[] transforms, Map<String, String> properties) throws org.apache.spark.sql.catalyst.analysis.TableAlreadyExistsException- Throws:
- org.apache.spark.sql.catalyst.analysis.TableAlreadyExistsException
 
- 
stageReplacepublic org.apache.spark.sql.connector.catalog.StagedTable stageReplace(org.apache.spark.sql.connector.catalog.Identifier ident, org.apache.spark.sql.types.StructType schema, org.apache.spark.sql.connector.expressions.Transform[] transforms, Map<String, String> properties) throws org.apache.spark.sql.catalyst.analysis.NoSuchTableException- Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchTableException
 
- 
stageCreateOrReplace
- 
alterTablepublic org.apache.spark.sql.connector.catalog.Table alterTable(org.apache.spark.sql.connector.catalog.Identifier ident, org.apache.spark.sql.connector.catalog.TableChange... changes) throws org.apache.spark.sql.catalyst.analysis.NoSuchTableException - Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchTableException
 
- 
dropTablepublic boolean dropTable(org.apache.spark.sql.connector.catalog.Identifier ident) 
- 
purgeTablepublic boolean purgeTable(org.apache.spark.sql.connector.catalog.Identifier ident) 
- 
renameTablepublic void renameTable(org.apache.spark.sql.connector.catalog.Identifier from, org.apache.spark.sql.connector.catalog.Identifier to) throws org.apache.spark.sql.catalyst.analysis.NoSuchTableException, org.apache.spark.sql.catalyst.analysis.TableAlreadyExistsException - Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchTableException
- org.apache.spark.sql.catalyst.analysis.TableAlreadyExistsException
 
- 
invalidateTablepublic void invalidateTable(org.apache.spark.sql.connector.catalog.Identifier ident) 
- 
listTables
- 
defaultNamespace
- 
listNamespaces
- 
listNamespacespublic String[][] listNamespaces(String[] namespace) throws org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException - Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException
 
- 
loadNamespaceMetadatapublic Map<String,String> loadNamespaceMetadata(String[] namespace) throws org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException - Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException
 
- 
createNamespacepublic void createNamespace(String[] namespace, Map<String, String> metadata) throws org.apache.spark.sql.catalyst.analysis.NamespaceAlreadyExistsException- Throws:
- org.apache.spark.sql.catalyst.analysis.NamespaceAlreadyExistsException
 
- 
alterNamespacepublic void alterNamespace(String[] namespace, org.apache.spark.sql.connector.catalog.NamespaceChange... changes) throws org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException - Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException
 
- 
dropNamespacepublic boolean dropNamespace(String[] namespace, boolean cascade) throws org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException - Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException
 
- 
listViews
- 
loadViewpublic org.apache.spark.sql.connector.catalog.View loadView(org.apache.spark.sql.connector.catalog.Identifier ident) throws org.apache.spark.sql.catalyst.analysis.NoSuchViewException - Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchViewException
 
- 
createViewpublic org.apache.spark.sql.connector.catalog.View createView(org.apache.spark.sql.connector.catalog.Identifier ident, String sql, String currentCatalog, String[] currentNamespace, org.apache.spark.sql.types.StructType schema, String[] queryColumnNames, String[] columnAliases, String[] columnComments, Map<String, String> properties) throws org.apache.spark.sql.catalyst.analysis.ViewAlreadyExistsException, org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException- Throws:
- org.apache.spark.sql.catalyst.analysis.ViewAlreadyExistsException
- org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException
 
- 
replaceViewpublic org.apache.spark.sql.connector.catalog.View replaceView(org.apache.spark.sql.connector.catalog.Identifier ident, String sql, String currentCatalog, String[] currentNamespace, org.apache.spark.sql.types.StructType schema, String[] queryColumnNames, String[] columnAliases, String[] columnComments, Map<String, String> properties) throws org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException, org.apache.spark.sql.catalyst.analysis.NoSuchViewExceptionDescription copied from interface:SupportsReplaceViewReplace a view in the catalog- Parameters:
- ident- a view identifier
- sql- the SQL text that defines the view
- currentCatalog- the current catalog
- currentNamespace- the current namespace
- schema- the view query output schema
- queryColumnNames- the query column names
- columnAliases- the column aliases
- columnComments- the column comments
- properties- the view properties
- Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException- If the identifier namespace does not exist (optional)
- org.apache.spark.sql.catalyst.analysis.NoSuchViewException- If the view doesn't exist or is a table
 
- 
alterViewpublic org.apache.spark.sql.connector.catalog.View alterView(org.apache.spark.sql.connector.catalog.Identifier ident, org.apache.spark.sql.connector.catalog.ViewChange... changes) throws org.apache.spark.sql.catalyst.analysis.NoSuchViewException, IllegalArgumentException - Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchViewException
- IllegalArgumentException
 
- 
dropViewpublic boolean dropView(org.apache.spark.sql.connector.catalog.Identifier ident) 
- 
renameViewpublic void renameView(org.apache.spark.sql.connector.catalog.Identifier fromIdentifier, org.apache.spark.sql.connector.catalog.Identifier toIdentifier) throws org.apache.spark.sql.catalyst.analysis.NoSuchViewException, org.apache.spark.sql.catalyst.analysis.ViewAlreadyExistsException - Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchViewException
- org.apache.spark.sql.catalyst.analysis.ViewAlreadyExistsException
 
- 
initializepublic final void initialize(String name, org.apache.spark.sql.util.CaseInsensitiveStringMap options) - Specified by:
- initializein interface- org.apache.spark.sql.connector.catalog.CatalogPlugin
 
- 
name
- 
icebergCatalogDescription copied from interface:HasIcebergCatalogReturns the underlyingCatalogbacking this Spark Catalog
- 
loadProcedurepublic Procedure loadProcedure(org.apache.spark.sql.connector.catalog.Identifier ident) throws NoSuchProcedureException Description copied from interface:ProcedureCatalogLoad astored procedurebyidentifier.- Specified by:
- loadProcedurein interface- ProcedureCatalog
- Parameters:
- ident- a stored procedure identifier
- Returns:
- the stored procedure's metadata
- Throws:
- NoSuchProcedureException- if there is no matching stored procedure
 
- 
isFunctionNamespace
- 
isExistingNamespace
- 
useNullableQuerySchemapublic boolean useNullableQuerySchema()- Specified by:
- useNullableQuerySchemain interface- org.apache.spark.sql.connector.catalog.TableCatalog
 
- 
listFunctionsdefault org.apache.spark.sql.connector.catalog.Identifier[] listFunctions(String[] namespace) throws org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException - Specified by:
- listFunctionsin interface- org.apache.spark.sql.connector.catalog.FunctionCatalog
- Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchNamespaceException
 
- 
loadFunctiondefault org.apache.spark.sql.connector.catalog.functions.UnboundFunction loadFunction(org.apache.spark.sql.connector.catalog.Identifier ident) throws org.apache.spark.sql.catalyst.analysis.NoSuchFunctionException - Specified by:
- loadFunctionin interface- org.apache.spark.sql.connector.catalog.FunctionCatalog
- Throws:
- org.apache.spark.sql.catalyst.analysis.NoSuchFunctionException
 
 
-