Class Schema
- java.lang.Object
- 
- org.apache.iceberg.Schema
 
- 
- All Implemented Interfaces:
- java.io.Serializable
 
 public class Schema extends java.lang.Object implements java.io.SerializableThe schema of a data table.Schema ID will only be populated when reading from/writing to table metadata, otherwise it will be default to 0. - See Also:
- Serialized Form
 
- 
- 
Constructor SummaryConstructors Constructor Description Schema(int schemaId, java.util.List<Types.NestedField> columns)Schema(int schemaId, java.util.List<Types.NestedField> columns, java.util.Map<java.lang.String,java.lang.Integer> aliases, java.util.Set<java.lang.Integer> identifierFieldIds)Schema(int schemaId, java.util.List<Types.NestedField> columns, java.util.Set<java.lang.Integer> identifierFieldIds)Schema(int schemaId, Types.NestedField... columns)Schema(java.util.List<Types.NestedField> columns)Schema(java.util.List<Types.NestedField> columns, java.util.Map<java.lang.String,java.lang.Integer> aliases)Schema(java.util.List<Types.NestedField> columns, java.util.Map<java.lang.String,java.lang.Integer> aliases, java.util.Set<java.lang.Integer> identifierFieldIds)Schema(java.util.List<Types.NestedField> columns, java.util.Set<java.lang.Integer> identifierFieldIds)Schema(Types.NestedField... columns)
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description Accessor<StructLike>accessorForField(int id)Returns an accessor for retrieving the data fromStructLike.java.lang.IntegeraliasToId(java.lang.String alias)Returns the column id for the given column alias.Types.StructTypeasStruct()Returns the underlyingstruct typefor this schema.Types.NestedFieldcaseInsensitiveFindField(java.lang.String name)Returns a sub-field by name as aTypes.NestedField.SchemacaseInsensitiveSelect(java.lang.String... names)Creates a projection schema for a subset of columns, selected by case insensitive namesSchemacaseInsensitiveSelect(java.util.Collection<java.lang.String> names)Creates a projection schema for a subset of columns, selected by case insensitive namesjava.util.List<Types.NestedField>columns()Returns a List of thecolumnsin this Schema.java.lang.StringfindColumnName(int id)Returns the full column name for the given id.Types.NestedFieldfindField(int id)Returns the sub-field identified by the field id as aTypes.NestedField.Types.NestedFieldfindField(java.lang.String name)Returns a sub-field by name as aTypes.NestedField.TypefindType(int id)Returns theTypeof a sub-field identified by the field id.TypefindType(java.lang.String name)Returns theTypeof a sub-field identified by the field name.java.util.Map<java.lang.String,java.lang.Integer>getAliases()Returns an alias map for this schema, if set.inthighestFieldId()Returns the highest field ID in this schema, including nested fields.java.util.Set<java.lang.Integer>identifierFieldIds()The set of identifier field IDs.java.util.Set<java.lang.String>identifierFieldNames()Returns the set of identifier field names.java.lang.StringidToAlias(java.lang.Integer fieldId)Returns the full column name in the unconverted data schema for the given column id.java.util.Map<java.lang.Integer,java.lang.String>idToName()Returns a map for this schema between field id and qualified field names.booleansameSchema(Schema anotherSchema)Checks whether this schema is equivalent to another schema while ignoring the schema ID.intschemaId()Returns the schema ID for this schema.Schemaselect(java.lang.String... names)Creates a projection schema for a subset of columns, selected by name.Schemaselect(java.util.Collection<java.lang.String> names)Creates a projection schema for a subset of columns, selected by name.java.lang.StringtoString()
 
- 
- 
- 
Constructor Detail- 
Schemapublic Schema(java.util.List<Types.NestedField> columns, java.util.Map<java.lang.String,java.lang.Integer> aliases) 
 - 
Schemapublic Schema(java.util.List<Types.NestedField> columns, java.util.Map<java.lang.String,java.lang.Integer> aliases, java.util.Set<java.lang.Integer> identifierFieldIds) 
 - 
Schemapublic Schema(java.util.List<Types.NestedField> columns) 
 - 
Schemapublic Schema(java.util.List<Types.NestedField> columns, java.util.Set<java.lang.Integer> identifierFieldIds) 
 - 
Schemapublic Schema(int schemaId, java.util.List<Types.NestedField> columns)
 - 
Schemapublic Schema(int schemaId, java.util.List<Types.NestedField> columns, java.util.Set<java.lang.Integer> identifierFieldIds)
 - 
Schemapublic Schema(int schemaId, java.util.List<Types.NestedField> columns, java.util.Map<java.lang.String,java.lang.Integer> aliases, java.util.Set<java.lang.Integer> identifierFieldIds)
 - 
Schemapublic Schema(Types.NestedField... columns) 
 - 
Schemapublic Schema(int schemaId, Types.NestedField... columns)
 
- 
 - 
Method Detail- 
schemaIdpublic int schemaId() Returns the schema ID for this schema.Note that schema ID will only be populated when reading from/writing to table metadata, otherwise it will be default to 0. 
 - 
highestFieldIdpublic int highestFieldId() Returns the highest field ID in this schema, including nested fields.
 - 
getAliasespublic java.util.Map<java.lang.String,java.lang.Integer> getAliases() Returns an alias map for this schema, if set.Alias maps are created when translating an external schema, like an Avro Schema, to this format. The original column names can be provided in a Map when constructing this Schema. - Returns:
- a Map of column aliases to field ids
 
 - 
idToNamepublic java.util.Map<java.lang.Integer,java.lang.String> idToName() Returns a map for this schema between field id and qualified field names.- Returns:
- a map of field id to qualified field names
 
 - 
asStructpublic Types.StructType asStruct() Returns the underlyingstruct typefor this schema.- Returns:
- the StructType version of this schema.
 
 - 
columnspublic java.util.List<Types.NestedField> columns() Returns a List of thecolumnsin this Schema.
 - 
identifierFieldIdspublic java.util.Set<java.lang.Integer> identifierFieldIds() The set of identifier field IDs.Identifier is a concept similar to primary key in a relational database system. It consists of a unique set of primitive fields in the schema. An identifier field must be at root, or nested in a chain of structs (no maps or lists). A row should be unique in a table based on the values of the identifier fields. Optional, float and double columns cannot be used as identifier fields. However, Iceberg identifier differs from primary key in the following ways: - Iceberg does not enforce the uniqueness of a row based on this identifier information. It is used for operations like upsert to define the default upsert key.
- A nested field in a struct can be used as an identifier. For example, if there is a "last_name" field inside a "user" struct in a schema, field "user.last_name" can be set as a part of the identifier field.
 - Returns:
- the set of identifier field IDs in this schema.
 
 - 
identifierFieldNamespublic java.util.Set<java.lang.String> identifierFieldNames() Returns the set of identifier field names.
 - 
findTypepublic Type findType(java.lang.String name) Returns theTypeof a sub-field identified by the field name.- Parameters:
- name- a field name
- Returns:
- a Type for the sub-field or null if it is not found
 
 - 
findTypepublic Type findType(int id) Returns theTypeof a sub-field identified by the field id.- Parameters:
- id- a field id
- Returns:
- a Type for the sub-field or null if it is not found
 
 - 
findFieldpublic Types.NestedField findField(int id) Returns the sub-field identified by the field id as aTypes.NestedField.- Parameters:
- id- a field id
- Returns:
- the sub-field or null if it is not found
 
 - 
findFieldpublic Types.NestedField findField(java.lang.String name) Returns a sub-field by name as aTypes.NestedField.The result may be a top-level or a nested field. - Parameters:
- name- a String name
- Returns:
- a Type for the sub-field or null if it is not found
 
 - 
caseInsensitiveFindFieldpublic Types.NestedField caseInsensitiveFindField(java.lang.String name) Returns a sub-field by name as aTypes.NestedField.The result may be a top-level or a nested field. - Parameters:
- name- a String name
- Returns:
- the sub-field or null if it is not found
 
 - 
findColumnNamepublic java.lang.String findColumnName(int id) Returns the full column name for the given id.- Parameters:
- id- a field id
- Returns:
- the full column name in this schema that resolves to the id
 
 - 
aliasToIdpublic java.lang.Integer aliasToId(java.lang.String alias) Returns the column id for the given column alias. Column aliases are set by conversions from Parquet or Avro to this Schema type.- Parameters:
- alias- a full column name in the unconverted data schema
- Returns:
- the column id in this schema, or null if the column wasn't found
 
 - 
idToAliaspublic java.lang.String idToAlias(java.lang.Integer fieldId) Returns the full column name in the unconverted data schema for the given column id. Column aliases are set by conversions from Parquet or Avro to this Schema type.- Parameters:
- fieldId- a column id in this schema
- Returns:
- the full column name in the unconverted data schema, or null if one wasn't found
 
 - 
accessorForFieldpublic Accessor<StructLike> accessorForField(int id) Returns an accessor for retrieving the data fromStructLike.Accessors do not retrieve data contained in lists or maps. - Parameters:
- id- a column id in this schema
- Returns:
- an Accessorto retrieve values from aStructLikerow
 
 - 
selectpublic Schema select(java.lang.String... names) Creates a projection schema for a subset of columns, selected by name.Names that identify nested fields will select part or all of the field's top-level column. - Parameters:
- names- String names for selected columns
- Returns:
- a projection schema from this schema, by name
 
 - 
selectpublic Schema select(java.util.Collection<java.lang.String> names) Creates a projection schema for a subset of columns, selected by name.Names that identify nested fields will select part or all of the field's top-level column. - Parameters:
- names- a List of String names for selected columns
- Returns:
- a projection schema from this schema, by name
 
 - 
caseInsensitiveSelectpublic Schema caseInsensitiveSelect(java.lang.String... names) Creates a projection schema for a subset of columns, selected by case insensitive namesNames that identify nested fields will select part or all of the field's top-level column. - Parameters:
- names- a List of String names for selected columns
- Returns:
- a projection schema from this schema, by names
 
 - 
caseInsensitiveSelectpublic Schema caseInsensitiveSelect(java.util.Collection<java.lang.String> names) Creates a projection schema for a subset of columns, selected by case insensitive namesNames that identify nested fields will select part or all of the field's top-level column. - Parameters:
- names- a List of String names for selected columns
- Returns:
- a projection schema from this schema, by names
 
 - 
sameSchemapublic boolean sameSchema(Schema anotherSchema) Checks whether this schema is equivalent to another schema while ignoring the schema ID.- Parameters:
- anotherSchema- another schema
- Returns:
- true if this schema is equivalent to the given schema
 
 - 
toStringpublic java.lang.String toString() - Overrides:
- toStringin class- java.lang.Object
 
 
- 
 
-