Description
Currently s3a buffers files on disk(s) before uploading. This JIRA investigates adding a memory-based upload implementation.
The motivation is evidently performance: this would be beneficial for users with high network bandwidth to S3 (EC2?) or users that run Hadoop directly on an S3-compatible object store (FYI: my contributions are made in name of Amplidata).
Attachments
Attachments
Issue Links
- depends upon
-
HADOOP-10400 Incorporate new S3A FileSystem implementation
- Closed
-
HADOOP-11463 Replace method-local TransferManager object with S3AFileSystem#transfers
- Closed
- is related to
-
HADOOP-13560 S3ABlockOutputStream to support huge (many GB) file writes
- Resolved