Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
HADOOP-13345
-
None
-
None
Description
HADOOP-13449 adds support for DynamoDBMetadataStore.
The code currently supports two options for choosing DynamoDB table names:
1. Use name of each s3 bucket and auto-create a DynamoDB table for each.
2. Configure a table name in the fs.s3a.s3guard.ddb.table parameter.
However, if a user sets fs.s3a.s3guard.ddb.table and accesses multiple buckets, DynamoDBMetadataStore does not properly differentiate between paths belonging to different buckets. For example, it would treat s3a://bucket-a/path1 as the same as s3a://bucket-b/path1.
Goals for this JIRA:
- Allow for a "one DynamoDB table per cluster" configuration. If a user accesess multiple buckets with that single table, it should work correctly.
- Explain which credentials are used for DynamoDB. Currently each S3AFileSystem has its own DynamoDBMetadataStore, which uses the credentials from the S3A fs. We at least need to document this behavior.
- Document any other limitations etc. in the s3guard.md site doc.
Attachments
Attachments
Issue Links
- depends upon
-
HADOOP-13589 S3Guard: Allow execution of all S3A integration tests with S3Guard enabled.
- Resolved
- is cloned by
-
HADOOP-14013 S3Guard: fix multi-bucket integration tests
- Resolved
- is depended upon by
-
HADOOP-13998 Merge initial S3guard release into trunk
- Resolved
- is related to
-
HADOOP-13336 S3A to support per-bucket configuration
- Resolved
-
HADOOP-17401 GCS to support per-bucket configuration
- Resolved