Package org.apache.iceberg.io
Class ResolvingFileIO
java.lang.Object
org.apache.iceberg.io.ResolvingFileIO
- All Implemented Interfaces:
Closeable
,Serializable
,AutoCloseable
,org.apache.hadoop.conf.Configurable
,HadoopConfigurable
,DelegateFileIO
,FileIO
,SupportsBulkOperations
,SupportsPrefixOperations
FileIO implementation that uses location scheme to choose the correct FileIO implementation.
Delegate FileIO implementations must implement the
DelegateFileIO
mixin interface,
otherwise initialization will fail.- See Also:
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionvoid
close()
Close File IO to release underlying resources.void
deleteFile
(String location) Delete the file at the given path.void
deleteFiles
(Iterable<String> pathsToDelete) Delete the files at the given paths.void
deletePrefix
(String prefix) Delete all files under a prefix.protected void
finalize()
org.apache.hadoop.conf.Configuration
getConf()
void
initialize
(Map<String, String> newProperties) Initialize File IO from catalog properties.Class
<?> listPrefix
(String prefix) Return an iterable of all files under a prefix.newInputFile
(String location) Get aInputFile
instance to read bytes from the file at the given path.newInputFile
(String location, long length) Get aInputFile
instance to read bytes from the file at the given path, with a known file length.newOutputFile
(String location) Get aOutputFile
instance to write bytes to the file at the given path.Returns the property map used to configure this FileIOvoid
serializeConfWith
(Function<org.apache.hadoop.conf.Configuration, SerializableSupplier<org.apache.hadoop.conf.Configuration>> confSerializer) Take a function that serializes Hadoop configuration into a supplier.void
setConf
(org.apache.hadoop.conf.Configuration conf) Methods inherited from class java.lang.Object
clone, equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Methods inherited from interface org.apache.iceberg.io.FileIO
deleteFile, deleteFile, newInputFile, newInputFile, newInputFile
-
Constructor Details
-
ResolvingFileIO
public ResolvingFileIO()No-arg constructor to load the FileIO dynamically.All fields are initialized by calling
initialize(Map)
later.
-
-
Method Details
-
newInputFile
Description copied from interface:FileIO
Get aInputFile
instance to read bytes from the file at the given path.- Specified by:
newInputFile
in interfaceFileIO
-
newInputFile
Description copied from interface:FileIO
Get aInputFile
instance to read bytes from the file at the given path, with a known file length.- Specified by:
newInputFile
in interfaceFileIO
-
newOutputFile
Description copied from interface:FileIO
Get aOutputFile
instance to write bytes to the file at the given path.- Specified by:
newOutputFile
in interfaceFileIO
-
deleteFile
Description copied from interface:FileIO
Delete the file at the given path.- Specified by:
deleteFile
in interfaceFileIO
-
deleteFiles
Description copied from interface:SupportsBulkOperations
Delete the files at the given paths.- Specified by:
deleteFiles
in interfaceSupportsBulkOperations
- Parameters:
pathsToDelete
- The paths to delete- Throws:
BulkDeletionFailureException
- in case of failure to delete at least 1 file
-
properties
Description copied from interface:FileIO
Returns the property map used to configure this FileIO- Specified by:
properties
in interfaceFileIO
-
initialize
Description copied from interface:FileIO
Initialize File IO from catalog properties.- Specified by:
initialize
in interfaceFileIO
- Parameters:
newProperties
- catalog properties
-
close
public void close()Description copied from interface:FileIO
Close File IO to release underlying resources.Calling this method is only required when this FileIO instance is no longer expected to be used, and the resources it holds need to be explicitly released to avoid resource leaks.
-
serializeConfWith
public void serializeConfWith(Function<org.apache.hadoop.conf.Configuration, SerializableSupplier<org.apache.hadoop.conf.Configuration>> confSerializer) Description copied from interface:HadoopConfigurable
Take a function that serializes Hadoop configuration into a supplier. An implementation is supposed to pass in its current Hadoop configuration into this function, and the result can be safely serialized for future use.- Specified by:
serializeConfWith
in interfaceHadoopConfigurable
- Parameters:
confSerializer
- A function that takes Hadoop configuration and returns a serializable supplier of it.
-
setConf
public void setConf(org.apache.hadoop.conf.Configuration conf) - Specified by:
setConf
in interfaceorg.apache.hadoop.conf.Configurable
-
getConf
public org.apache.hadoop.conf.Configuration getConf()- Specified by:
getConf
in interfaceorg.apache.hadoop.conf.Configurable
-
ioClass
-
finalize
-
listPrefix
Description copied from interface:SupportsPrefixOperations
Return an iterable of all files under a prefix.Hierarchical file systems (e.g. HDFS) may impose additional restrictions like the prefix must fully match a directory whereas key/value object stores may allow for arbitrary prefixes.
- Specified by:
listPrefix
in interfaceSupportsPrefixOperations
- Parameters:
prefix
- prefix to list- Returns:
- iterable of file information
-
deletePrefix
Description copied from interface:SupportsPrefixOperations
Delete all files under a prefix.Hierarchical file systems (e.g. HDFS) may impose additional restrictions like the prefix must fully match a directory whereas key/value object stores may allow for arbitrary prefixes.
- Specified by:
deletePrefix
in interfaceSupportsPrefixOperations
- Parameters:
prefix
- prefix to delete
-