[HADOOP-13345] S3Guard: Improved Consistency for S3A - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 2.8.1
Fix Version/s: 2.9.0, 3.0.0-beta1
Component/s: fs/s3
Labels:
None

Target Version/s:

2.9.0
Release Note:

Hide
S3Guard (pronounced see-guard) is a new feature for the S3A connector to Amazon S3, which uses DynamoDB for a high performance and consistent metadata repository. Essentially: S3Guard caches directory information, so your S3A clients get faster lookups and resilience to inconsistency between S3 list operations and the status of objects. When files are created, with S3Guard, they'll always be found.

S3Guard does not address update consistency: if a file is updated, while the directory information will be updated, calling open() on the path may still return the old data. Similarly, deleted objects may also potentially be opened.

Please consult the S3Guard documentation in the Amazon S3 section of our documentation.

Note: part of this update includes moving to a new version of the AWS SDK 1.11, one which includes the Dynamo DB client and its a shaded version of Jackson 2. The large aws-sdk-bundle JAR is needed to use the S3A client with or without S3Guard enabled. The good news: because Jackson is shaded, there will be no conflict between any Jackson version used in your application and that which the AWS SDK needs.

Show
S3Guard (pronounced see-guard) is a new feature for the S3A connector to Amazon S3, which uses DynamoDB for a high performance and consistent metadata repository. Essentially: S3Guard caches directory information, so your S3A clients get faster lookups and resilience to inconsistency between S3 list operations and the status of objects. When files are created, with S3Guard, they'll always be found. S3Guard does not address update consistency: if a file is updated, while the directory information will be updated, calling open() on the path may still return the old data. Similarly, deleted objects may also potentially be opened. Please consult the S3Guard documentation in the Amazon S3 section of our documentation. Note: part of this update includes moving to a new version of the AWS SDK 1.11, one which includes the Dynamo DB client and its a shaded version of Jackson 2. The large aws-sdk-bundle JAR is needed to use the S3A client with or without S3Guard enabled. The good news: because Jackson is shaded, there will be no conflict between any Jackson version used in your application and that which the AWS SDK needs.

Description

This issue proposes S3Guard, a new feature of S3A, to provide an option for a stronger consistency model than what is currently offered. The solution coordinates with a strongly consistent external store to resolve inconsistencies caused by the S3 eventual consistency model.

Attachments

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HADOOP-13345.prototype1.patch
06/Jul/16 21:25
76 kB
Chris Nauroth
s3c.001.patch
14/Jul/16 21:14
61 kB
Lei (Eddy) Xu
S3C-ConsistentListingonS3-Design.pdf
14/Jul/16 21:14
245 kB
Lei (Eddy) Xu
S3GuardImprovedConsistencyforS3A.pdf
06/Jul/16 21:25
431 kB
Chris Nauroth
S3GuardImprovedConsistencyforS3AV2.pdf
01/Aug/16 23:47
328 kB
Chris Nauroth

Issue Links

depends upon

HADOOP-14432 S3A copyFromLocalFile to be robust, tested

Resolved

HADOOP-13912 S3a Multipart Committer (avoid rename)

Resolved

HADOOP-13852 hadoop build to allow hadoop version property to be explicitly set

Resolved

incorporates

HADOOP-14838 backport S3guard to branch-2

Resolved

is depended upon by

SPARK-17593 list files on s3 very slow

Resolved

HADOOP-13204 Über-jira: S3a phase III: scale and tuning

Resolved

is related to

HADOOP-11487 FileNotFound on distcp to s3n/s3a due to creation inconsistency

Resolved

SPARK-18883 FileNotFoundException on _temporary directory

Resolved

is superceded by

HADOOP-14825 Über-JIRA: S3Guard Phase II: Hadoop 3.1 features

Resolved

supercedes

HADOOP-14161 Failed to rename file in S3A during FileOutputFormat commitTask

Resolved

(1 is depended upon by, 2 is related to, 1 is superceded by, 1 supercedes)

Sub-Tasks

1.

Support running isolated unit tests separate from AWS integration tests.

Resolved

Chris Nauroth

2.

S3Guard: Define MetadataStore interface.

Resolved

Chris Nauroth

3.

S3Guard: Implement DynamoDBMetadataStore.

Resolved

Mingliang Liu

4.

S3Guard: Implement access policy providing strong consistency with S3 as source of truth.

Closed

Unassigned

5.

S3Guard: Implement access policy using metadata store as source of truth.

Closed

Unassigned

6.

S3Guard: Implement access policy for intra-client consistency with in-memory metadata store.

Resolved

Aaron Fabbri

7.

S3Guard: Instrument new functionality with Hadoop metrics.

Resolved

Ai Deng

8.

S3Guard: Write end user docs, change table autocreate default.

Resolved

Aaron Fabbri

9.

S3Guard: create basic contract tests for MetadataStore implementations

Resolved

Aaron Fabbri

10.

S3Guard: implement move() for LocalMetadataStore, add unit tests

Resolved

Aaron Fabbri

11.

S3Guard: Allow execution of all S3A integration tests with S3Guard enabled.

Resolved

Steve Loughran

12.

S3Guard: S3AFileSystem Integration with MetadataStore

Resolved

Aaron Fabbri

13.

S3Guard: Provide command line tools to manipulate metadata store.

Resolved

Lei (Eddy) Xu

14.

Change PathMetadata to hold S3AFileStatus instead of FileStatus.

Resolved

Lei (Eddy) Xu

15.

S3Guard: better support for multi-bucket access

Resolved

Aaron Fabbri

16.

S3Guard: add delete tracking

Resolved

Aaron Fabbri

17.

s3guard: add inconsistency injection, integration tests

Resolved

Aaron Fabbri

18.

s3guard to log choice of metadata store at debug

Resolved

Mingliang Liu

19.

S3Guard: fix TestDynamoDBMetadataStore when fs.s3a.s3guard.ddb.table is set

Resolved

Aaron Fabbri

20.

s3guard: ITestS3AFileOperationCost.testFakeDirectoryDeletion failure

Resolved

Mingliang Liu

21.

dynamodb dependency -> compile

Resolved

Mingliang Liu

22.

DynamoDBMetadataStore to handle DDB throttling failures through retry policy

Resolved

Aaron Fabbri

23.

tune dynamodb client & tests

Resolved

Steve Loughran

24.

s3guard: improve S3AFileStatus#isEmptyDirectory handling

Resolved

Aaron Fabbri

25.

S3Guard: Existing tables may not be initialized correctly in DynamoDBMetadataStore

Resolved

Mingliang Liu

26.

S3Guard: NPE when table is already populated in dynamodb and user specifies "fs.s3a.s3guard.ddb.table.create=false"

Closed

Mingliang Liu

27.

S3AGuard: Use BatchWriteItem in DynamoDBMetadataStore#put()

Resolved

Mingliang Liu

28.

S3Guard: S3AFileSystem::listLocatedStatus() to employ MetadataStore

Resolved

Mingliang Liu

29.

S3Guard: DynamoDBMetadataStore#move() could be throwing exception due to BatchWriteItem limits

Resolved

Mingliang Liu

30.

Mock bucket locations in MockS3ClientFactory

Resolved

Mingliang Liu

31.

S3guard: replace dynamo.describe() call in init with more efficient query

Closed

Mingliang Liu

32.

Initialize DynamoDBMetadataStore without associated S3AFileSystem

Resolved

Mingliang Liu

33.

Add ability to start DDB local server in every test

Resolved

Mingliang Liu

34.

s3guard: add a version marker to every table

Resolved

Steve Loughran

35.

S3Guard CLI: Add documentation

Resolved

Aaron Fabbri

36.

s3guard cli: make tests easier to run and address failure

Resolved

Sean Mackrory

37.

Merge initial S3guard release into trunk

Resolved

Steve Loughran

38.

cli to list info about a bucket (S3guard or not)

Resolved

Unassigned

39.

Handled dynamo exceptions in translateException

Resolved

Unassigned

40.

S3Guard: fix multi-bucket integration tests

Resolved

Aaron Fabbri

41.

Optimize dirListingUnion

Resolved

Sean Mackrory

42.

S3Guard: DynamoDBMetadataStore logs nonsense region

Resolved

Sean Mackrory

43.

Implicitly creating DynamoDB table ignores endpoint config

Resolved

Sean Mackrory

44.

S3Guard: intermittent duplicate item keys failure

Resolved

Mingliang Liu

45.

CLI command to prune old metadata

Resolved

Sean Mackrory

46.

Metastore destruction test creates table without version marker

Resolved

Sean Mackrory

47.

S3Guard: link docs from index, fix typos

Resolved

Aaron Fabbri

48.

Fix breaking link in s3guard.md

Resolved

Mingliang Liu

49.

Drop unnecessary type assertion and cast

Resolved

Sean Mackrory

50.

Allow users to specify region for DynamoDB table instead of endpoint

Resolved

Sean Mackrory

51.

Rethink S3GuardTool options

Resolved

Sean Mackrory

52.

s3guard: regression in dirListingUnion

Resolved

Aaron Fabbri

53.

ITestS3GuardListConsistency fails intermittently

Resolved

Mingliang Liu

54.

In S3AFileSystem, make getAmazonClient() package private; export getBucketLocation()

Resolved

Steve Loughran

55.

s3guard tool tests aren't isolated; can't run in parallel

Resolved

Sean Mackrory

56.

ITestS3ACredentialsInURL sometimes fails

Resolved

Sean Mackrory

57.

Simplify DynamoDBClientFactory for creating Amazon DynamoDB clients

Resolved

Mingliang Liu

58.

s3guard: CLI diff non-empty after import on new table

Resolved

Sean Mackrory

59.

Ensure GenericOptionParser is used for S3Guard CLI

Resolved

Sean Mackrory

60.

Add S3Guard.dirListingUnion in S3AFileSystem#listFiles, listLocatedStatus

Closed

Unassigned

61.

S3GuardTool tests should not run if S3Guard is not set up

Resolved

Sean Mackrory

62.

S3Guard: import does not import empty directory

Resolved

Sean Mackrory

63.

Add validation of DynamoDB region

Resolved

Sean Mackrory

64.

DynamoDB client should waitForActive on existing tables

Resolved

Sean Mackrory

65.

Add s3guardtool dump command

Resolved

Unassigned

66.

S3Guard: DynamoDBMetadataStore::move() should populate ancestor directories of destination paths

Resolved

Mingliang Liu

67.

S3Guard: S3AFileSystem::rename() should move non-listed sub-directory entries in metadata store

Resolved

Mingliang Liu

68.

S3Guard: ITestS3AConcurrentOps is not cleaning up test data

Resolved

Mingliang Liu

69.

TestS3GuardTool hangs/fails when offline: it's an IT test

Resolved

Mingliang Liu

70.

S3Guard: S3AFileSystem::listFiles() to employ MetadataStore

Resolved

Mingliang Liu

71.

S3Guard: DynamoDBMetadata::prune() should self interrupt correctly

Resolved

Mingliang Liu

72.

TestDynamoDBMetadataStore is broken unless we can fail faster without a table version

Resolved

Sean Mackrory

73.

ITestS3GuardListConsistency failure w/ Local, authoritative metadata store

Resolved

Aaron Fabbri

74.

S3Guard: S3GuardTool to support provisioning existing metadata store

Resolved

Steve Loughran

75.

s3guard will set file length to -1 on a putObjectDirect(stream, -1) call

Resolved

Steve Loughran

76.

ITestS3GuardConcurrentOps.testConcurrentTableCreations fails without table name configured

Resolved

Sean Mackrory

77.

Play nice with ITestS3AEncryptionSSEC

Resolved

Sean Mackrory

78.

create() does not notify metadataStore of parent directories or ensure they're not existing files

Resolved

Sean Mackrory

79.

S3Guard: Improve FNFE message when opening a stream

Resolved

Aaron Fabbri

80.

make InconsistentAmazonS3Client usable in downstream tests

Resolved

Aaron Fabbri

81.

Ensure deleted parent directory tombstones are overwritten when implicitly recreated

Resolved

Sean Mackrory

82.

DirListingMetadata precondition failure messages to include path at fault

Resolved

Steve Loughran

83.

s3guard w/ failure injection: listStatus fails after renaming file into directory

Resolved

Sean Mackrory

84.

ITestS3GuardConcurrentOps requires explicit DynamoDB table name to be configured

Resolved

Sean Mackrory

85.

Findbugs warning in LocalMetadataStore

Resolved

Sean Mackrory

86.

ProvidedFileStatusIterator#next() may throw IndexOutOfBoundsException

Resolved

Aaron Fabbri

87.

simplify mkdirs() after S3Guard delete tracking change

Resolved

Sean Mackrory

88.

InconsistentAmazonS3Client adds extra paths to listStatus() after delete.

Resolved

Sean Mackrory

89.

ITestS3GuardListConsistency is too slow

Resolved

Aaron Fabbri

90.

S3Guard: issues running parallel tests w/ S3N

Resolved

Aaron Fabbri

91.

LocalDynamoDB missing from latest AWS SDK releases

Resolved

Steve Loughran

92.

S3Guard: optimize create codepath

Resolved

Aaron Fabbri

93.

add a predicate/option to probe an S3A FS for being consistent

Resolved

Unassigned

94.

ITestS3GuardConcurrentOps failing with -Ddynamodblocal -Ds3guard

Resolved

Steve Loughran

95.

ITestS3AEncryptionSSEC failing in parallel s3guard runs

Resolved

Steve Loughran

96.

Review S3guard docs & code prior to merge

Resolved

Steve Loughran

0%

Original Estimate - 24h

Remaining Estimate - 24h

97.

S3Guard premerge changes: java 7 build & test tuning

Resolved

Steve Loughran

98.

s3guard diff demand creates a new table

Resolved

Unassigned

99.

hadoop-aws shell profile not being built

Resolved

Allen Wittenauer

100.

S3Guard: handle provisioning failure through backoff & retry (& metrics)

Resolved

Unassigned

101.

s3guard usage calls function incorrectly

Resolved

Allen Wittenauer

102.

backport S3guard to branch-2

Resolved

Steve Loughran

Activity

People

Assignee:: Chris Nauroth

Reporter:: Chris Nauroth

Votes:: 8 Vote for this issue

Watchers:: 73 Start watching this issue

Dates

Created:: 06/Jul/16 21:23

Updated:: 03/Feb/18 03:08

Resolved:: 01/Sep/17 14:49

Time Tracking

Estimated:

24h

Remaining:

24h

Logged:

Not Specified

Include sub-tasks