Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-15619

Über-JIRA: S3Guard Phase IV: Hadoop 3.3 features

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: fs/s3
    • Labels:
      None
    • Target Version/s:

      Description

      Features for S3Guard for Hadoop 3.3. Goal: take the experimental tag off

        Attachments

          Issue Links

          1.
          Add more s3guard metrics Sub-task Open Gabor Bota
          2.
          S3guard metadata stores to support millions of entries Sub-task Open Unassigned
          3.
          add bulk delete call to metastore API & DDB impl Sub-task Resolved Unassigned
          4.
          Improve DynamoDB schema update story Sub-task Open Sean Mackrory
          5.
          Ensure controls in-place to prevent clients with significant clock skews pruning aggressively Sub-task Open Unassigned
          6.
          s3guard DynamoDB resource not found: tables not ACTIVE state after initial connection Sub-task Resolved Unassigned
          7.
          S3Guard to support creating on-demand DDB tables Sub-task Resolved Steve Loughran
          8.
          s3guard set-capacity to not retry on an access denied exception Sub-task Resolved Steve Loughran
          9.
          S3Guard instrumentation to include cost of DynamoDB ops as metric Sub-task Open Unassigned
          10.
          S3Guard: Provide custom FileSystem Statistics. Sub-task Resolved Unassigned
          11.
          increase performance of s3guard import command Sub-task Open Unassigned
          12.
          improvements to S3GuardTool destroy command Sub-task Open Gabor Bota
          13.
          s3guard set-capacity command to fail fast if bucket is unguarded Sub-task Resolved Gabor Bota
          14.
          S3Guard, MetadataStore to support atomic create(path, overwrite=false) Sub-task Resolved Unassigned
          15.
          S3AFileSystem.innerRename() to size metadatastore lists better Sub-task Resolved Unassigned
          16.
          Scheme assertion in S3Guard DynamoDBMetadataStore::checkPath is unnecessarily restrictive Sub-task Open Unassigned
          17.
          S3Guard: remove "experimental" from documentation Sub-task Open Steve Loughran
          18.
          Possible for modified configuration to leak into metadatastore in S3GuardTool Sub-task Open Unassigned
          19.
          S3Guard: DynamoDB can go out of sync with S3AFileSystem.delete() Sub-task Resolved Steve Loughran
          20.
          S3Guard store becomes inconsistent after partial failure of rename Sub-task Resolved Steve Loughran
          21.
          S3Guard: make short-circuit getFileStatus() configurable Sub-task Resolved Gabor Bota
          22.
          hadoop fs -mkdir -p path-ending-with-slash/ fails with s3guard Sub-task Open Steve Loughran
          23.
          S3Guard: Implement time-based (TTL) expiry for Authoritative Directory Listing Sub-task Resolved Gabor Bota
          24.
          S3Guard to self update on directory listings of S3 Sub-task Open Gabor Bota
          25.
          reduce/tune read failure fault injection on inconsistent client Sub-task Open Unassigned
          26.
          S3guard mistakes root URI without / as non-absolute path Sub-task Open Unassigned
          27.
          s3guard bucket-info will create s3guard table if FS is set to do this automatically Sub-task Resolved Gabor Bota
          28.
          ITestS3GuardTool failures in testDestroyNoBucket() Sub-task Resolved Gabor Bota
          29.
          Move S3Guard LocalMetadataStore constants to org.apache.hadoop.fs.s3a.Constants Sub-task Resolved Gabor Bota
          30.
          S3guard: add inconsistency detection metrics Sub-task Open Gabor Bota
          31.
          S3Guard: document how to deal with non-S3Guard processes writing data to S3Guarded buckets Sub-task Resolved Gabor Bota
          32.
          ClassCastException: S3AFileSystem cannot be cast to MockS3AFileSystem when fs caching is disabled Sub-task Resolved Gabor Bota
          33.
          LocalMetadataStore put() does not retain isDeleted in parent listing Sub-task Resolved Gabor Bota
          34.
          ITestS3AEmptyDirectory#testDirectoryBecomesEmpty fails when running with dynamo Sub-task Resolved Gabor Bota
          35.
          ITestS3GuardListConsistency#testConsistentListAfterDelete fails when running with dynamo Sub-task Resolved Unassigned
          36.
          NPE in DynamoDBMetadataStore.lambda$listChildren for root + auth S3Guard Sub-task Resolved Gabor Bota
          37.
          Intermittent failures of some S3A tests with S3Guard in parallel test runs Sub-task Resolved Unassigned
          38.
          Improve throttling on S3Guard DDB batch retries Sub-task Open Unassigned
          39.
          DynamoDB table Update can fail S3A FS init Sub-task Resolved Steve Loughran
          40.
          tag S3GuardTool entry points as limitedPrivate("management-tools")/evolving Sub-task Open Steve Loughran
          41.
          s3guard init and destroy command will create/destroy tables if ddb.table & region are set Sub-task Resolved Gabor Bota
          42.
          S3Guard testConcurrentTableCreations to set r & w capacity == 0 Sub-task Resolved lqjacklee
          43.
          s3guard bucket-info command to not print a stack trace on bucket-not-found Sub-task Resolved Adam Antal
          44.
          ITestDynamoDBMetadataStore can leak (large) DDB tables in test failures/timeout Sub-task Resolved Steve Loughran
          45.
          getFileChecksum() needs to adopt S3Guard Sub-task In Progress lqjacklee
          46.
          ITestDynamoDBMetadataStore should check if test ddb table set properly before initializing the test Sub-task Resolved Gabor Bota
          47.
          Should be able to set empty directory flag to TRUE in DynamoDBMetadataStore#innerGet when using authoritative directory listings Sub-task Resolved Gabor Bota
          48.
          S3Guard: Better support for out-of-band operations Sub-task Resolved Gabor Bota
          49.
          IndexOutOfBoundsException in ITestS3GuardToolLocal Sub-task Resolved Adam Antal
          50.
          S3Guard: use object version or etags to protect against inconsistent read after replace/overwrite Sub-task Resolved Ben Roling
          51.
          S3Guard to support on-demand DDB tables Sub-task Resolved Steve Loughran
          52.
          S3Guard: Handle OOB deletions and creation of a file which has a tombstone marker Sub-task Resolved Gabor Bota
          53.
          S3Guard: Optimize performance of handling OOB operations in non-authoritative mode Sub-task Open Unassigned
          54.
          S3Guard: NPE in DynamoDBMetadataStore.lambda$listChildren Sub-task Resolved Gabor Bota
          55.
          ITestS3GuardToolDynamoDB test failures Sub-task Resolved Steve Loughran
          56.
          S3Guard: fail write that doesn't update metadata store Sub-task Resolved Ben Roling
          57.
          Add a way to declare a toplevel path in a non-auth FS as auth Sub-task Resolved Sean Mackrory
          58.
          Use configurable dynamo table name prefix in S3Guard tests Sub-task Resolved Ben Roling
          59.
          transient failure of ITestS3GuardToolDynamoDB.testDestroyUnknownTable Sub-task Resolved Unassigned
          60.
          S3Guard: Implement time-based (TTL) expiry for entries (and tombstones) Sub-task Resolved Gabor Bota
          61.
          S3Guard: Retry failed read with backoff in Authoritative mode when file can be opened Sub-task Resolved Steve Loughran
          62.
          S3Guard: Remove LocalMetadataStore Sub-task Resolved Unassigned
          63.
          "Wrong Billing mode" test failure in ITestS3GuardToolDynamoDB Sub-task Resolved Steve Loughran
          64.
          Add option for a prefix to put in front of every s3guard table Sub-task Open Unassigned
          65.
          DynamoDBMetadataStore.getVersionMarkerItem() to log at info/warn on retry Sub-task Resolved Gabor Bota
          66.
          ZookeeperMetadataStore: Use Zookeeper as S3Guard backend store Sub-task Open Unassigned
          67.
          ITestS3GuardToolDynamoDB.testDynamoDBInitDestroyCycle fails if test bucket isn't on demand Sub-task Open Steve Loughran
          68.
          S3Guard DDB store prune() doesn't translate AWS exceptions to IOEs Sub-task Resolved Steve Loughran
          69.
          S3Guard table destroy to map IllegalArgumentExceptions to IOEs Sub-task Resolved Steve Loughran
          70.
          S3A list operation doesn't pick up etags from results Sub-task Resolved Steve Loughran
          71.
          S3AFileSystem copyFile to propagate etag/version from getObjectMetadata to copy request Sub-task Open Unassigned
          72.
          ITestS3AMetadataPersistenceException failure Sub-task Resolved Unassigned
          73.
          S3A tombstones can confuse empty directory status Sub-task Resolved Steve Loughran
          74.
          Clock skew can cause S3Guard to think object metadata is out of date Sub-task Open Unassigned
          75.
          Pass ITtlTimeProvider instance in initialize method in MetadataStore interface Sub-task Resolved Gabor Bota
          76.
          S3A: Avoid inconsistencies between DDB and S3 Sub-task Resolved Steve Loughran
          77.
          S3Guard Diff tool to list+ compare the etag and version fields Sub-task Resolved Unassigned
          78.
          S3Guard init command uses global settings, not those of target bucket Sub-task Reopened Steve Loughran
          79.
          Hadoop S3Guard Prune command to support a -tombstone option. Sub-task Resolved Steve Loughran
          80.
          ITestDynamoDBMetadataStore.testProvisionTable times out intermittently Sub-task Resolved Steve Loughran
          81.
          S3a getFileStatus to update DDB if an S3 query returns etag/versionID Sub-task Open Unassigned
          82.
          mark DynamoDBMetadataStore.deleteTrackingValueMap as final Sub-task Resolved kevin su
          83.
          Transient failure of ITestS3ARemoteFileChanged Sub-task Open Unassigned
          84.
          ITestS3GuardOutOfBandOperations.deleteAfterTombstoneExpiryOobCreate failure Sub-task Resolved Unassigned
          85.
          DynamoDBMetaStore deleteSubtree to delete leaf nodes first Sub-task Resolved Unassigned
          86.
          S3AFilesystem.delete to incrementally update s3guard with deletions Sub-task Resolved Steve Loughran
          87.
          S3Guard: Filter expired entries and tombstones when listing with MetadataStore#listChildren Sub-task Resolved Gabor Bota
          88.
          Fix ITestS3GuardToolLocal#testInitNegativeRead test failure Sub-task Open Steve Loughran
          89.
          Intermittent failure of ITestS3GuardToolDynamoDB#testDynamoDBInitDestroyCycle Sub-task Open Unassigned
          90.
          S3Guard in auth mode doesn't raise AccessDeniedException on read of 0-byte file Sub-task Resolved Unassigned
          91.
          findbugs warning on LocalMetadataStore.ttlTimeProvider sync Sub-task Resolved Steve Loughran
          92.
          S3Guard prune to only remove auth dir marker if files (not tombstones) are removed Sub-task Open Unassigned
          93.
          S3Guard ProgressiveRenameTracker to mark dest dir as authoritative on success Sub-task Open Unassigned
          94.
          Intermittent failure of ITestS3GuardConcurrentOps#testConcurrentTableCreations Sub-task Open Unassigned
          95.
          ITestS3GuardDDBRootOperations.test_300_MetastorePrune needs to set region Sub-task Resolved Steve Loughran
          96.
          S3A to warn or fail if S3Guard is disabled Sub-task Open Gabor Bota
          97.
          S3Guard operations log has tombstone/PUT swapped Sub-task Resolved Steve Loughran
          98.
          Avoid/handle cached 404s during S3A file creation Sub-task Resolved Steve Loughran
          99.
          s3guard auth path checks only check against unqualified source path Sub-task Resolved Steve Loughran
          100.
          Race condition in DDB table init and waiting threads Sub-task Resolved Gabor Bota
          101.
          s3guard prune command doesn't get AWS auth chain from FS Sub-task Resolved Steve Loughran
          102.
          Region must be provided when requesting session credentials or SdkClientException will be thrown Sub-task Resolved Gabor Bota
          103.
          ITestS3GuardDDBRootOperations. test_100_FilesystemPrune failure Sub-task Resolved Steve Loughran
          104.
          intermittent failure of ITestCommitOperations: too many s3guard writes Sub-task Open Unassigned
          105.
          ITestDynamoDBMetadataStoreScale fails when throttled. Sub-task Patch Available Steve Loughran

            Activity

              People

              • Assignee:
                stevel@apache.org Steve Loughran
                Reporter:
                stevel@apache.org Steve Loughran
              • Votes:
                0 Vote for this issue
                Watchers:
                7 Start watching this issue

                Dates

                • Created:
                  Updated: