Package org.apache.iceberg.hadoop
Class HadoopFileIO
java.lang.Object
org.apache.iceberg.hadoop.HadoopFileIO
- All Implemented Interfaces:
- Closeable,- Serializable,- AutoCloseable,- org.apache.hadoop.conf.Configurable,- HadoopConfigurable,- DelegateFileIO,- FileIO,- SupportsBulkOperations,- SupportsPrefixOperations
- See Also:
- 
Constructor SummaryConstructorsConstructorDescriptionConstructor used for dynamic FileIO loading.HadoopFileIO(org.apache.hadoop.conf.Configuration hadoopConf) HadoopFileIO(SerializableSupplier<org.apache.hadoop.conf.Configuration> hadoopConf) 
- 
Method SummaryModifier and TypeMethodDescriptionorg.apache.hadoop.conf.Configurationconf()voiddeleteFile(String path) Delete the file at the given path.voiddeleteFiles(Iterable<String> pathsToDelete) Delete the files at the given paths.voiddeletePrefix(String prefix) Delete all files under a prefix.org.apache.hadoop.conf.ConfigurationgetConf()voidinitialize(Map<String, String> props) Initialize File IO from catalog properties.listPrefix(String prefix) Return an iterable of all files under a prefix.newInputFile(String path) Get aInputFileinstance to read bytes from the file at the given path.newInputFile(String path, long length) Get aInputFileinstance to read bytes from the file at the given path, with a known file length.newOutputFile(String path) Get aOutputFileinstance to write bytes to the file at the given path.Returns the property map used to configure this FileIOvoidserializeConfWith(Function<org.apache.hadoop.conf.Configuration, SerializableSupplier<org.apache.hadoop.conf.Configuration>> confSerializer) Take a function that serializes Hadoop configuration into a supplier.voidsetConf(org.apache.hadoop.conf.Configuration conf) Methods inherited from class java.lang.Objectclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.apache.iceberg.io.FileIOclose, deleteFile, deleteFile, newInputFile, newInputFile, newInputFile
- 
Constructor Details- 
HadoopFileIOpublic HadoopFileIO()Constructor used for dynamic FileIO loading.Hadoop configurationmust be set throughsetConf(Configuration)
- 
HadoopFileIOpublic HadoopFileIO(org.apache.hadoop.conf.Configuration hadoopConf) 
- 
HadoopFileIO
 
- 
- 
Method Details- 
confpublic org.apache.hadoop.conf.Configuration conf()
- 
initializeDescription copied from interface:FileIOInitialize File IO from catalog properties.- Specified by:
- initializein interface- FileIO
- Parameters:
- props- catalog properties
 
- 
newInputFileDescription copied from interface:FileIOGet aInputFileinstance to read bytes from the file at the given path.- Specified by:
- newInputFilein interface- FileIO
 
- 
newInputFileDescription copied from interface:FileIOGet aInputFileinstance to read bytes from the file at the given path, with a known file length.- Specified by:
- newInputFilein interface- FileIO
 
- 
newOutputFileDescription copied from interface:FileIOGet aOutputFileinstance to write bytes to the file at the given path.- Specified by:
- newOutputFilein interface- FileIO
 
- 
deleteFileDescription copied from interface:FileIODelete the file at the given path.- Specified by:
- deleteFilein interface- FileIO
 
- 
propertiesDescription copied from interface:FileIOReturns the property map used to configure this FileIO- Specified by:
- propertiesin interface- FileIO
 
- 
setConfpublic void setConf(org.apache.hadoop.conf.Configuration conf) - Specified by:
- setConfin interface- org.apache.hadoop.conf.Configurable
 
- 
getConfpublic org.apache.hadoop.conf.Configuration getConf()- Specified by:
- getConfin interface- org.apache.hadoop.conf.Configurable
 
- 
serializeConfWithpublic void serializeConfWith(Function<org.apache.hadoop.conf.Configuration, SerializableSupplier<org.apache.hadoop.conf.Configuration>> confSerializer) Description copied from interface:HadoopConfigurableTake a function that serializes Hadoop configuration into a supplier. An implementation is supposed to pass in its current Hadoop configuration into this function, and the result can be safely serialized for future use.- Specified by:
- serializeConfWithin interface- HadoopConfigurable
- Parameters:
- confSerializer- A function that takes Hadoop configuration and returns a serializable supplier of it.
 
- 
listPrefixDescription copied from interface:SupportsPrefixOperationsReturn an iterable of all files under a prefix.Hierarchical file systems (e.g. HDFS) may impose additional restrictions like the prefix must fully match a directory whereas key/value object stores may allow for arbitrary prefixes. - Specified by:
- listPrefixin interface- SupportsPrefixOperations
- Parameters:
- prefix- prefix to list
- Returns:
- iterable of file information
 
- 
deletePrefixDescription copied from interface:SupportsPrefixOperationsDelete all files under a prefix.Hierarchical file systems (e.g. HDFS) may impose additional restrictions like the prefix must fully match a directory whereas key/value object stores may allow for arbitrary prefixes. - Specified by:
- deletePrefixin interface- SupportsPrefixOperations
- Parameters:
- prefix- prefix to delete
 
- 
deleteFilesDescription copied from interface:SupportsBulkOperationsDelete the files at the given paths.- Specified by:
- deleteFilesin interface- SupportsBulkOperations
- Parameters:
- pathsToDelete- The paths to delete
- Throws:
- BulkDeletionFailureException- in case of failure to delete at least 1 file
 
 
-