Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-15620

Über-jira: S3A phase VI: Hadoop 3.3 features

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.2.0
    • 3.3.0
    • fs/s3
    • None
    • Hide
      Lots of enhancements to the S3A code, including
      * Delegation Token support
      * better handling of 404 caching
      * S3guard performance, resilience improvements
      Show
      Lots of enhancements to the S3A code, including * Delegation Token support * better handling of 404 caching * S3guard performance, resilience improvements

    Attachments

      Issue Links

      1.
      s3a create(overwrite=true) to only look for dir/ and list entries, not file Sub-task Resolved Steve Loughran   Actions
      2.
      S3A init hangs if you try to connect while the system is offline Sub-task Resolved Unassigned   Actions
      3.
      S3AInputStream to implement CanUnbuffer Sub-task Resolved Sahil Takiar   Actions
      4.
      S3A: Consider using TransferManager.download for copyToLocalFile Sub-task Resolved Unassigned   Actions
      5.
      Parallelize S3A directory deletes Sub-task Resolved Unassigned   Actions
      6.
      Add support for S3 Select to S3A Sub-task Resolved Steve Loughran   Actions
      7.
      S3A to support Delegation Tokens Sub-task Resolved Steve Loughran   Actions
      8.
      Stabilise/formalise the JSON _SUCCESS format used in the S3A committers Sub-task Resolved Unassigned   Actions
      9.
      Add S3A implementation of FSMainOperationsBaseTest Sub-task Resolved Steve Loughran   Actions
      10.
      S3a to support get/set permissions through S3 object tags Sub-task Resolved Unassigned   Actions
      11.
      S3a rename() to copy files in a directory in parallel Sub-task Resolved Unassigned   Actions
      12.
      Add HTrace to the s3a connector Sub-task Resolved Madhawa Kasun Gunasekara   Actions
      13.
      S3A should allow renaming to a pre-existing destination directory to move the source path under that directory, similar to HDFS. Sub-task Resolved Unassigned   Actions
      14.
      fs -expunge to take a filesystem Sub-task Resolved Shweta   Actions
      15.
      S3A Retry policy to retry on NoResponseException Sub-task Resolved Steve Loughran   Actions
      16.
      S3A client raising ConnectionPoolTimeoutException in test Sub-task Resolved Unassigned   Actions
      17.
      Bulk commits of S3A MPUs place needless excessive load on S3 & S3Guard Sub-task Resolved Steve Loughran   Actions
      18.
      S3A log message on rm s3a://bucket/ not intuitive Sub-task Resolved Gabor Bota   Actions
      19.
      S3aUtils.getEncryptionAlgorithm() always logs@Debug "Using SSE-C" Sub-task Resolved Unassigned   Actions
      20.
      S3A warning of obsolete encryption key which is never used Sub-task Resolved Unassigned   Actions
      21.
      add s3guard CLI command to generate session keys for an assumed role Sub-task Resolved Steve Loughran   Actions
      22.
      S3AFileSystem.verifyBucketExists to move to s3.doesBucketExistV2 Sub-task Resolved lqjacklee   Actions
      23.
      FileSystemMultipartUploader should verify that UploadHandle has non-0 length Sub-task Resolved Ewan Higgs   Actions
      24.
      Memory leak in S3AOutputStream Sub-task Resolved Steve Loughran   Actions
      25.
      [s3a] stop treat fs.s3a.max.threads as the long-term minimum Sub-task Resolved Sean Mackrory   Actions
      26.
      S3 listing inconsistency can raise NPE in globber Sub-task Resolved Steve Loughran   Actions
      27.
      remove obsolete S3A test ITestS3ACredentialsInURL Sub-task Resolved Steve Loughran   Actions
      28.
      S3A input stream to use etags/version number to detect changed source files Sub-task Resolved Ben Roling   Actions
      29.
      Move ITestS3AMiniYarnCluster to S3A committers Sub-task Resolved Steve Loughran   Actions
      30.
      @Retries annotation of putObject() call & uses wrong Sub-task Resolved Steve Loughran   Actions
      31.
      Review + update cloud store sensitive keys in hadoop.security.sensitive-config-keys Sub-task Resolved Steve Loughran   Actions
      32.
      ITestS3AContractMultipartUploader#testMultipartUploadEmptyPart test error Sub-task Resolved Ewan Higgs   Actions
      33.
      Some S3A committer tests don't match ITest* pattern; don't run in maven Sub-task Resolved Steve Loughran   Actions
      34.
      get patch for S3a nextReadPos(), through Yetus Sub-task Resolved lqjacklee   Actions
      35.
      Hadoop aws does not use shaded jars Sub-task Resolved Unassigned   Actions
      36.
      Oozie unable to create sharelib in s3a filesystem Sub-task Resolved Steve Loughran   Actions
      37.
      S3A committers: make sure there's regular progress() calls Sub-task Resolved lqjacklee   Actions
      38.
      S3AFileSystem.verifyBucketExists to move to s3.doesBucketExistV2 Sub-task Resolved lqjacklee   Actions
      39.
      Add bouncycastle jars to hadoop-aws as test dependencies Sub-task Resolved Steve Loughran   Actions
      40.
      [DOC] Effective use of FS instances during S3A integration tests Sub-task Resolved Gabor Bota   Actions
      41.
      hamcrest-library declaration in hadoop-aws to be scoped test Sub-task Resolved Steve Loughran   Actions
      42.
      S3A SSL connections should use OpenSSL Sub-task Resolved Sahil Takiar   Actions
      43.
      S3A tests to include Terasort Sub-task Resolved Steve Loughran   Actions
      44.
      Token.toString faulting if any token listed can't load. Sub-task Resolved Steve Loughran   Actions
      45.
      Move DurationInfo from hadoop-aws to hadoop-common org.apache.hadoop.util Sub-task Resolved Abhishek Modi   Actions
      46.
      Parquet reading S3AFileSystem causes EOF Sub-task Resolved Steve Loughran   Actions
      47.
      Update AWS SDK to 1.11.563 Sub-task Resolved Steve Loughran

      0%

      Original Estimate - 24h
      Remaining Estimate - 24h
      Actions
      48.
      Extend documentation in testing.md about endpoint constants Sub-task Resolved Adam Antal   Actions
      49.
      regression: ITestS3AMiniYarnCluster failing on branch-3.2 Sub-task Resolved Unassigned   Actions
      50.
      S3Guard to add DynamoDBLocal Support Sub-task Resolved lqjacklee   Actions
      51.
      S3A copyFile operation to include source versionID or etag in the copy request Sub-task Resolved Steve Loughran   Actions
      52.
      S3A MarshalledCredentials.toString() doesn't print full date/time of expiry Sub-task Resolved Steve Loughran   Actions
      53.
      S3AUtils.translateException to map CredentialInitializationException to AccessDeniedException Sub-task Resolved Steve Loughran   Actions
      54.
      S3AFileSystem#innerMkdirs builds needless lists Sub-task Resolved Lokesh Jain   Actions
      55.
      warning about user:pass in URI to explicitly call out Hadoop 3.2 as removal Sub-task Resolved Steve Loughran   Actions
      56.
      Improved S3A MR tests Sub-task Resolved Steve Loughran   Actions
      57.
      S3AFileStatus to declare that isEncrypted() is always true Sub-task Resolved Steve Loughran   Actions
      58.
      S3A delegation tests fail if you set fs.s3a.secret.key Sub-task Resolved Unassigned   Actions
      59.
      S3A Etag tests fail with default encryption enabled on bucket Sub-task Resolved Ben Roling   Actions
      60.
      S3A Delegation Token code to spell "Marshalled" as Marshaled Sub-task Resolved Steve Loughran   Actions
      61.
      s3a test docs to mention non-auth; or s3a tests to default to non-auth Sub-task Resolved Unassigned   Actions
      62.
      ClassCastException in S3GuardTool.checkMetadataStoreUri Sub-task Resolved Steve Loughran   Actions
      63.
      Regression: TestStagingPartitionedJobCommit failing with empty etag list Sub-task Resolved Steve Loughran   Actions
      64.
      Remove S3A's depedency on http core Sub-task Resolved Steve Loughran   Actions
      65.
      Test Hang in S3A S3guard test MetadataStoreTestBase.testListChildren Sub-task Resolved Unassigned   Actions
      66.
      Stabilize S3A OpenSSL support Sub-task Resolved Sahil Takiar   Actions
      67.
      MapReduce job tasks fails on S3A ssl3_get_server_certificate:certificate verify Sub-task Resolved Steve Loughran   Actions
      68.
      TeraSort Job failing on S3 DirectoryStagingCommitter: destination path exists Sub-task Resolved Steve Loughran   Actions
      69.
      S3A NullPointerException: null uri host. This can be caused by unencoded / in the password string Sub-task Resolved Unassigned   Actions
      70.
      Option to disable GCM for SSL connections when running on Java 8 Sub-task Resolved Sahil Takiar   Actions
      71.
      S3AInputStream#unbuffer should merge input stream stats into fs-wide stats Sub-task Resolved Sahil Takiar   Actions
      72.
      S3A returns 400 "bad request" on a single path within an S3 bucket Sub-task Resolved Unassigned   Actions
      73.
      AbstractITCommitMRJob.testMRJob test failures Sub-task Resolved Unassigned   Actions
      74.
      Downgrade INFO message on rm s3a root dir to DEBUG Sub-task Resolved Unassigned   Actions
      75.
      ITestS3ACommitterFactory failing, S3 client is not inconsistent Sub-task Resolved Steve Loughran   Actions
      76.
      Typo in s3a committers.md doc Sub-task Resolved Unassigned   Actions
      77.
      Make last AWS credential provider in default auth chain EC2ContainerCredentialsProviderWrapper Sub-task Resolved Steve Loughran   Actions
      78.
      Restore (documented) fs.s3a.SharedInstanceProfileCredentialsProvider Sub-task Resolved Steve Loughran   Actions
      79.
      S3A delegation token tests fail if fs.s3a.encryption.key set Sub-task Resolved Steve Loughran   Actions
      80.
      S3Guard bucket-info fails if the bucket location is denied to the caller Sub-task Resolved Steve Loughran   Actions
      81.
      S3A retry policy to be exponential Sub-task Resolved Steve Loughran   Actions
      82.
      S3ADelegationTokens to only log at debug on startup Sub-task Resolved Steve Loughran   Actions
      83.
      S3A committers leak threads/raises OOM on job/task commit at scale Sub-task Resolved Steve Loughran   Actions
      84.
      s3a attempts to look up password/encryption fail if JCEKS file unreadable Sub-task Resolved Unassigned   Actions
      85.
      S3A ITestRestrictedReadAccess fails Sub-task Resolved Steve Loughran   Actions
      86.
      Speculating & Partitioned S3A magic committers can leave pending files under __magic Sub-task Resolved Steve Loughran   Actions
      87.
      S3A ITest*MRjob failures Sub-task Resolved Siddharth Seth   Actions
      88.
      S3A ITest failures without S3Guard Sub-task Resolved Steve Loughran   Actions
      89.
      S3A innerGetFileStatus s"directories only" scan still does a HEAD Sub-task Resolved Steve Loughran   Actions
      90.
      S3A Delegation Token extension point to use StoreContext Sub-task Resolved Steve Loughran   Actions
      91.
      ITestS3AClosedFS failing -junit test thread Sub-task Resolved Steve Loughran   Actions
      92.
      S3 getBucketLocation() can return "US" for us-east Sub-task Resolved Steve Loughran   Actions
      93.
      S3Guard DDB overreacts to no tag access Sub-task Resolved Gabor Bota   Actions
      94.
      HadoopExecutors cleanup to only log at debug Sub-task Resolved David Mollitor   Actions
      95.
      S3Guard: Make authoritative mode exclusive for metadata - don't check for expiry for authoritative paths Sub-task Resolved Gabor Bota   Actions
      96.
      S3A bucket existence checks to support v2 API and "no checks at all" Sub-task Resolved Mukund Thakur   Actions
      97.
      S3GuardTool to support FilterFileSystem Sub-task Resolved Steve Loughran   Actions
      98.
      s3guard prune can delete directories -leaving orphan children. Sub-task Resolved Steve Loughran   Actions
      99.
      S3Guard to support encrypted DynamoDB table Sub-task Resolved Mingliang Liu   Actions
      100.
      S3A empty dir markers are not created in s3guard as authoritative Sub-task Resolved Steve Loughran   Actions
      101.
      DurationInfo text parsing/formatting should be moved out of hotpath Sub-task Resolved Rajesh Balamohan   Actions
      102.
      Increase timeout unit test rule for MetadataStoreTestBase Sub-task Resolved Mingliang Liu   Actions
      103.
      Refine testing.md to tell user better how to use auth-keys.xml Sub-task Resolved Mingliang Liu   Actions
      104.
      fs.s3a.authoritative.path should support multiple FS URIs Sub-task Resolved Steve Loughran   Actions
      105.
      Filesystem openFile() builder to take a FileStatus param Sub-task Resolved Steve Loughran   Actions
      106.
      S3AInputStream reopening does not handle non IO exceptions properly Sub-task Resolved Sergei Poganshev

      100%

      Original Estimate - Not Specified Original Estimate - Not Specified
      Time Spent - 1h 10m
      Actions
      107.
      Let s3 clients configure request timeout Sub-task Resolved Mustafa İman   Actions
      108.
      S3Guard listFiles will not query S3 if all listings are authoritative Sub-task Resolved Mustafa İman   Actions
      109.
      Large DeleteObject requests are their own Thundering Herd Sub-task Resolved Steve Loughran   Actions
      110.
      TestHarFileSystem.testInheritedMethodsImplemented broken Sub-task Resolved Steve Loughran   Actions
      111.
      ITestS3GuardOutOfBandOperations failing on versioned S3 buckets Sub-task Resolved Steve Loughran   Actions
      112.
      S3A reverts KMS encryption to the bucket's default KMS key in rename/copy Sub-task Resolved Mukund Thakur

      100%

      Original Estimate - Not Specified Original Estimate - Not Specified
      Time Spent - 1h 10m
      Actions

      Activity

        This comment will be Viewable by All Users Viewable by All Users
        Cancel

        People

          stevel@apache.org Steve Loughran
          stevel@apache.org Steve Loughran
          Votes:
          0 Vote for this issue
          Watchers:
          15 Start watching this issue

          Dates

            Created:
            Updated:
            Resolved:

            Time Tracking

              Estimated:
              Original Estimate - 24h Original Estimate - 24h
              24h
              Remaining:
              Time Spent - 2h 20m Remaining Estimate - 24h
              24h
              Logged:
              Time Spent - 2h 20m Remaining Estimate - 24h
              2h 20m

              Slack

                Issue deployment