Class S3FileIO
- All Implemented Interfaces:
Closeable
,Serializable
,AutoCloseable
,CredentialSupplier
,DelegateFileIO
,FileIO
,SupportsBulkOperations
,SupportsPrefixOperations
Locations used must follow the conventions for S3 URIs (e.g. s3://bucket/path...). URIs with
schemes s3a, s3n, https are also treated as s3 file paths. Using this FileIO with other schemes
will result in ValidationException
.
- See Also:
-
Constructor Summary
ConstructorDescriptionS3FileIO()
No-arg constructor to load the FileIO dynamically.S3FileIO
(SerializableSupplier<software.amazon.awssdk.services.s3.S3Client> s3) Constructor with custom s3 supplier and S3FileIO properties.S3FileIO
(SerializableSupplier<software.amazon.awssdk.services.s3.S3Client> s3, S3FileIOProperties s3FileIOProperties) Constructor with custom s3 supplier and S3FileIO properties. -
Method Summary
Modifier and TypeMethodDescriptionsoftware.amazon.awssdk.services.s3.S3Client
client()
void
close()
Close File IO to release underlying resources.void
deleteFile
(String path) Delete the file at the given path.void
deleteFiles
(Iterable<String> paths) Deletes the given paths in a batched manner.void
deletePrefix
(String prefix) This method provides a "best-effort" to delete all objects under the given prefix.protected void
finalize()
Returns the credential stringvoid
initialize
(Map<String, String> props) Initialize File IO from catalog properties.listPrefix
(String prefix) Return an iterable of all files under a prefix.newInputFile
(String path) Get aInputFile
instance to read bytes from the file at the given path.newInputFile
(String path, long length) Get aInputFile
instance to read bytes from the file at the given path, with a known file length.newOutputFile
(String path) Get aOutputFile
instance to write bytes to the file at the given path.Returns the property map used to configure this FileIOMethods inherited from class java.lang.Object
clone, equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Methods inherited from interface org.apache.iceberg.io.FileIO
deleteFile, deleteFile, newInputFile, newInputFile, newInputFile
-
Constructor Details
-
S3FileIO
public S3FileIO()No-arg constructor to load the FileIO dynamically.All fields are initialized by calling
initialize(Map)
later. -
S3FileIO
Constructor with custom s3 supplier and S3FileIO properties.Calling
initialize(Map)
will overwrite information set in this constructor.- Parameters:
s3
- s3 supplier
-
S3FileIO
public S3FileIO(SerializableSupplier<software.amazon.awssdk.services.s3.S3Client> s3, S3FileIOProperties s3FileIOProperties) Constructor with custom s3 supplier and S3FileIO properties.Calling
initialize(Map)
will overwrite information set in this constructor.- Parameters:
s3
- s3 suppliers3FileIOProperties
- S3 FileIO properties
-
-
Method Details
-
newInputFile
Description copied from interface:FileIO
Get aInputFile
instance to read bytes from the file at the given path.- Specified by:
newInputFile
in interfaceFileIO
-
newInputFile
Description copied from interface:FileIO
Get aInputFile
instance to read bytes from the file at the given path, with a known file length.- Specified by:
newInputFile
in interfaceFileIO
-
newOutputFile
Description copied from interface:FileIO
Get aOutputFile
instance to write bytes to the file at the given path.- Specified by:
newOutputFile
in interfaceFileIO
-
deleteFile
Description copied from interface:FileIO
Delete the file at the given path.- Specified by:
deleteFile
in interfaceFileIO
-
properties
Description copied from interface:FileIO
Returns the property map used to configure this FileIO- Specified by:
properties
in interfaceFileIO
-
deleteFiles
Deletes the given paths in a batched manner.The paths are grouped by bucket, and deletion is triggered when we either reach the configured batch size or have a final remainder batch for each bucket.
- Specified by:
deleteFiles
in interfaceSupportsBulkOperations
- Parameters:
paths
- paths to delete- Throws:
BulkDeletionFailureException
- in case of failure to delete at least 1 file
-
listPrefix
Description copied from interface:SupportsPrefixOperations
Return an iterable of all files under a prefix.Hierarchical file systems (e.g. HDFS) may impose additional restrictions like the prefix must fully match a directory whereas key/value object stores may allow for arbitrary prefixes.
- Specified by:
listPrefix
in interfaceSupportsPrefixOperations
- Parameters:
prefix
- prefix to list- Returns:
- iterable of file information
-
deletePrefix
This method provides a "best-effort" to delete all objects under the given prefix.Bulk delete operations are used and no reattempt is made for deletes if they fail, but will log any individual objects that are not deleted as part of the bulk operation.
- Specified by:
deletePrefix
in interfaceSupportsPrefixOperations
- Parameters:
prefix
- prefix to delete
-
client
public software.amazon.awssdk.services.s3.S3Client client() -
getCredential
Description copied from interface:CredentialSupplier
Returns the credential string- Specified by:
getCredential
in interfaceCredentialSupplier
-
initialize
Description copied from interface:FileIO
Initialize File IO from catalog properties.- Specified by:
initialize
in interfaceFileIO
- Parameters:
props
- catalog properties
-
close
public void close()Description copied from interface:FileIO
Close File IO to release underlying resources.Calling this method is only required when this FileIO instance is no longer expected to be used, and the resources it holds need to be explicitly released to avoid resource leaks.
-
finalize
-