Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-18477

Über-jira: S3A Hadoop 3.3.9 features

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.3.5
    • None
    • fs/s3
    • None

    Description

      Changes related to s3a in the next branch-3.3 release.
      Presence in this list != any commitment to implement, unless there's active dev

      Attachments

        Issue Links

          1.
          ITestCustomSigner uses absolute paths off the bucket root rather than fork-relative Sub-task Open Unassigned  
          2.
          S3A input stream to support ByteBufferReadable Sub-task Open Unassigned  
          3.
          Tune hadoop-aws parallel test surefire/failsafe settings Sub-task Open Unassigned  
          4.
          Impersonate hosts in s3a for better data locality handling Sub-task Open Thomas Demoor  
          5.
          ITestS3AContractRootDir failure on non-S3Guarded bucket Sub-task Resolved Unassigned  
          6.
          Document `dynamodb:TagResource` an explicit client-side permission for S3Guard Sub-task Resolved Gabor Bota  
          7.
          Public dataset class for S3A integration tests Sub-task Open Daniel Carl Jones  
          8.
          Some S3A tests leak filesystem instances Sub-task Open Unassigned

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 0.5h
          9.
          s3 and abfs incremental listing: use SAX parsers to stream results to list iterators Sub-task Open Unassigned  
          10.
          builld up md5 checksum as blocks are built in S3ABlockOutputStream; validate upload Sub-task Open Unassigned  
          11.
          Use S3 content-range header to update length of an object during reads Sub-task Open Monthon Klongklaew

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 50m
          12.
          S3a operations keep retrying if the password is wrong Sub-task Open Thomas Poepping  
          13.
          S3A: Set thread names with more specific information about the call. Sub-task Open Unassigned  
          14.
          NPE in S3AInputStream.read() in ITestS3AInconsistency.testOpenFailOnRead Sub-task Open Unassigned  
          15.
          ITestS3ARemoteFileChanged doesn't overwrite test data creation Sub-task Open Unassigned  
          16.
          test and document use of fs.s3a.signing-algorithm Sub-task Open Unassigned  
          17.
          s3a listing IOStatistics to count #of entries returned per LIST call Sub-task Open Unassigned  
          18.
          S3A delegation token binding to support secondary binding list Sub-task Resolved Steve Loughran  
          19.
          S3A DT marshalling to include nested error text in wrapped message Sub-task Open Unassigned  
          20.
          increase the default number of threads and http connections in S3A Sub-task Open Unassigned  
          21.
          clean up ITestS3AFileSystemContract Sub-task Patch Available Unassigned  
          22.
          S3A to implement rename(final Path src, final Path dst, final Rename... options) Sub-task Open Unassigned  
          23.
          ITestS3AInputStreamPerformance#testDecompressionSequential128K NPE if no CSV file available Sub-task Open Unassigned  
          24.
          hadoop-cloud-storage transient dependencies need review Sub-task Resolved Steve Loughran  
          25.
          Possible inconsistent state of AbstractDelegationTokenSecretManager Sub-task Patch Available Hankó Gergely

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 1h 10m
          26.
          S3A DT support to warn when loading expired token Sub-task Open Steve Loughran  
          27.
          s3a rm on the CLI generates deprecation warning on io.bytes.per.checksum Sub-task Open Unassigned  
          28.
          Clarify committers.md around v2 failure handling Sub-task Open Unassigned  
          29.
          Handle S3A "glacier" data Sub-task Open Bhavay Pahuja  
          30.
          Test MR split optimisation with recursive listing Sub-task Open Unassigned  
          31.
          s3a to improve diags on s3a bad request message Sub-task Open Unassigned  
          32.
          Support AWS IAM Identity Centre (prev. AWS SSO) for providing credentials to S3A Sub-task Open Unassigned  
          33.
          NPE in s3a byte buffer block upload Sub-task Resolved Unassigned  
          34.
          S3AInputStream logging to make it easier to debug file leakage Sub-task Open Unassigned  
          35.
          ITestS3AConfiguration.testProxyConnection failing when s3a bucket probe disabled Sub-task Open Unassigned  
          36.
          support git-secrets commit hook to keep AWS secrets out of git Sub-task Patch Available Steve Loughran  
          37.
          S3A doesn't calculate Content-MD5 on uploads Sub-task Open Unassigned  
          38.
          cherry pick s3 ehancements from PrestoS3FileSystem Sub-task Open Unassigned  
          39.
          S3A deleteObjects hanging/retrying forever Sub-task Open Unassigned  
          40.
          Understand status of S3 access point alias support in S3A Sub-task Open Unassigned  
          41.
          Add custom InstanceProfileCredentialsProvider with more resilience to throttling Sub-task Open Unassigned  
          42.
          S3A connector to improve support for all AWS partitions Sub-task Open Unassigned  
          43.
          GCS to support per-bucket configuration Sub-task Open Unassigned  
          44.
          S3aDelegationTokens to add accessor for tests to get at the token binding Sub-task Open Unassigned  
          45.
          ITestDynamoDBMetadataStore.testTableVersioning failure -DDB deleteItem consistency? Sub-task Resolved Unassigned  
          46.
          S3 Select Exceptions are not being converted to IOEs Sub-task Open Unassigned  
          47.
          compatibility table in directory_markers.md doesn't render right Sub-task Resolved Masatake Iwasaki  
          48.
          s3a new getdefaultblocksize be called in getFileStatus which has not been implemented in s3afilesystem yet Sub-task Open Unassigned  
          49.
          S3A Xattr/getXAttr to handle directories without markers Sub-task Open Unassigned  
          50.
          Support multipart download in S3AFileSystem Sub-task Open Unassigned  
          51.
          review S3A translateException translation matches IBM CORS spec Sub-task Open Unassigned  
          52.
          add a special 0 byte input stream for empty blobs Sub-task Open Unassigned  
          53.
          Remove fs.s3a.executor.capacity Sub-task Open Viraj Jasani  
          54.
          Optimise S3A’s recursive delete to drop successful S3 keys on retry of S3 DeleteObjects Sub-task Open Unassigned  
          55.
          S3A (async) ObjectListingIterator to block in hasNext() for results Sub-task Open Steve Loughran

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 1h 40m
          56.
          S3A: Allow SSE configurations per object path Sub-task Open Unassigned  
          57.
          strip s3.amazonaws.com off hostnames before making s3a calls Sub-task Open Unassigned  
          58.
          Filesystem discovery to stop loading implementation classes Sub-task Open Unassigned  
          59.
          shell rm command to not rename to ~/.Trash in object stores Sub-task Open Unassigned  
          60.
          Use lighter-weight alternatives to innerGetFileStatus where possible Sub-task Open Unassigned  
          61.
          Speed up S3A test runs Sub-task Open Unassigned  
          62.
          FileSystem/s3a processDeleteOnExit to skip the exists() check Sub-task Open Unassigned  
          63.
          Remove transient dependency on hadoop-hdfs-client Sub-task Open Unassigned  
          64.
          s3guard bucket-info command to add a verify-property <key>=<value> <bucket> Sub-task Open Unassigned  
          65.
          S3A Secret access to fall back to XML if credential provider raises IOE. Sub-task Open Unassigned  
          66.
          ITestS3A select tests fail if user kinited in Sub-task Open Unassigned  
          67.
          ITestS3AContractSeek teardown closes test FS before superclass can do its cleanup Sub-task Open Unassigned  
          68.
          Add s3a tool to convert S3 server logs to avro/csv files Sub-task Open Mehakmeet Singh  
          69.
          abfs & s3a FS instantiate triggers warning about deprecated io.bytes.per.checksum Sub-task Resolved Unassigned  
          70.
          remove filtering of directory markers in s3a RenameOperation Sub-task Open Unassigned  
          71.
          S3AFilesystem trash handling should respect the current UGI Sub-task Open Unassigned  
          72.
          S3A can support short user-friendly aliases for configuration of credential providers. Sub-task Open Unassigned  
          73.
          S3AFileStatus to add a serialVersionUID; review & test serialization Sub-task Open Unassigned  
          74.
          Add common getFileBlockLocations() emulation for object stores, including S3A Sub-task Patch Available Steve Loughran  
          75.
          S3A Filesystem does not check return from AmazonS3Client deleteObjects Sub-task Open Unassigned  
          76.
          S3A client retries on SSL Auth exceptions triggered by "." bucket names Sub-task Open Unassigned  
          77.
          s3guard uploads command to list date and initiator of outstanding uploads Sub-task Open Unassigned  
          78.
          Address JavaDoc warnings in classes like MarkerTool, S3ObjectAttributes, etc. Sub-task Resolved Ankit Saurabh  
          79.
          S3ARetryPolicy to handle AWS 500 responses/error code TooBusyException with the throttle backoff policy Sub-task Open Unassigned  
          80.
          S3AInputStream.skip() to use lazy seek Sub-task Open Ahmar Suhail

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 4h 20m
          81.
          s3guard bucket-info command to include default bucket encryption info Sub-task Open Unassigned  
          82.
          S3A DeleteOperation to parallelize POSTing of bulk deletes Sub-task Open Unassigned  
          83.
          ITestCustomSigner fails with NPE against private endpoint Sub-task Resolved Unassigned  
          84.
          S3a DelegationToken bindings to to support a "correlation ID" for the UA header Sub-task Open Unassigned  
          85.
          Review S3A documentation to make sure it is consistent with the current codebase Sub-task Open Unassigned  
          86.
          S3AInputStream.remainingInFile should use nextReadPos Sub-task Reopened lqjacklee  
          87.
          Add S3AWriteOpContext for write ops; pass in statistics and other settings Sub-task Open Unassigned  
          88.
          log accepted/rejected fs.s3a.authoritative.path paths @ debug Sub-task Open Unassigned  
          89.
          typo in TestNeworkBinding Sub-task Resolved Steve Loughran  
          90.
          S3A to support configuring various AWS S3 client extended options Sub-task Open Unassigned  
          91.
          Add AWS S3 Transfer acceleration support Sub-task Open Unassigned  
          92.
          Add a way for an FS instance to say "really, no trash interval at all" Sub-task Open Unassigned  
          93.
          AWS AssumedRoleCredentialProvider needs ExternalId add Sub-task Open Unassigned  
          94.
          Report problems w/ local S3A buffer directory meaningfully Sub-task Open Unassigned  
          95.
          ITestS3AMiniYarnCluster fails on sequential runs with Kerberos error Sub-task Open Unassigned  
          96.
          test YARN log collection works to s3a Sub-task Open Unassigned  
          97.
          Encrypt S3A buffered data on disk Sub-task Open Unassigned  
          98.
          S3AInputStream read(bytes[]) to not retry on read failure: pass action up Sub-task Open Ahmar Suhail

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 40m
          99.
          make s3a read fault injection configurable including "off" Sub-task Open Unassigned  
          100.
          S3a auth exception to link to a wiki page on the problem Sub-task Open Unassigned  
          101.
          Improve S3A delegations token documentation Sub-task Resolved Ahmar Suhail

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 20m
          102.
          Test hadoop fs shell against s3a; fix problems Sub-task Open Unassigned

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 2h
          103.
          AbstractContractDistCpTest to test attr preservation with -p, verify blobstores downgrade Sub-task Open Steve Loughran  
          104.
          S3AInputStream.seek should throw EOFException if seeking past the end of file Sub-task Open Unassigned  
          105.
          multipart/huge file upload tests to look at checksums returned Sub-task Open Unassigned  
          106.
          TestS3AGetFileStatus:testNotFound() to use intercept() Sub-task Open Unassigned  
          107.
          ITestS3AAWSCredentialsProvider tests fail if a bucket has DTs enabled Sub-task Open Unassigned  
          108.
          ITestS3APrefetchingInputStream does not skip if no CSV test file available Sub-task Resolved Daniel Carl Jones  
          109.
          hadoop-aws tests to take a configurable subdir in the test bucket Sub-task Open Unassigned  
          110.
          support multiple s3a integration test runs on same bucket in parallel Sub-task In Progress Steve Loughran  
          111.
          S3A: add option to disable probe for dir marker recreation on delete/rename. Sub-task Open Harshit Gupta  
          112.
          New configuration for static headers to be added to all S3 requests Sub-task Open Unassigned  
          113.
          Warn when no region is configured Sub-task Open Unassigned  
          114.
          S3A AssumedRole credentials provider should use Instance Role credentials in chain for assuming role Sub-task Open Unassigned  
          115.
          define s3a encryption behaviour on copy Sub-task Open Unassigned  
          116.
          S3A openFile() options to allow etag/version to be set Sub-task Reopened Unassigned  
          117.
          improve s3a committer stats collected Sub-task Open Unassigned  
          118.
          Add "versions" tool to s3a command line entry point Sub-task Open Unassigned  
          119.
          Add API for bulk/paged object deletion Sub-task Open Unassigned  
          120.
          Add new store vendor config option Sub-task Open Unassigned  
          121.
          Amazon S3 disabling ACLs on all new buckets Sub-task Open Unassigned  
          122.
          S3A audit header to include count of items in delete ops Sub-task Resolved Viraj Jasani  
          123.
          S3A ITestS3ABucketExistence access point test failure Sub-task Resolved Steve Loughran  
          124.
          S3ABlockOutputStream doesn't collect stats on multipart block uploads Sub-task Resolved Steve Loughran  
          125.
          S3A Authentication to support WebIdentity Sub-task Open Unassigned

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 3h 20m
          126.
          ITestS3ABlockOutputArray failure with IO File name too long Sub-task Open Unassigned  
          127.
          Change fs.s3a.directory.marker.retention to "keep" Sub-task Resolved Steve Loughran  
          128.
          Upgrade aws-java-sdk to 1.12.367+ Sub-task Resolved Viraj Jasani  
          129.
          Test failures with CSE enabled Sub-task Open Ahmar Suhail  
          130.
          ITestS3AFileSystemStatistic failure on mvn verify Sub-task Open Unassigned  
          131.
          Make it possible to disable ITestS3ATemporaryCredentials Sub-task Resolved Unassigned  
          132.
          ITestS3ACopyFromLocalFile: AuditFailureException Sub-task Open Unassigned  
          133.
          Upgrade aws-java-sdk to 1.12.499+ Sub-task Resolved Viraj Jasani  
          134.
          Add a way to get the IOStatistics of active filesystems in long-lived processes Sub-task Open Unassigned  
          135.
          Add ability to configure ConnectionTTL of http connections while creating S3 Client. Sub-task Resolved Mukund Thakur  
          136.
          s3a client SSLException is raised after very long timeout "Unsupported or unrecognized SSL message" Sub-task Open Unassigned  
          137.
          Support Overwrite Directory On Commit For S3A Committers Sub-task Open Syed Shameerur Rahman  
          138.
          remove/deprecate fs.s3a.multipart.purge Sub-task Open Unassigned  
          139.
          ITestS3GuardTool fails if SSE/DSSE encryption is used Sub-task Resolved Viraj Jasani  
          140.
          S3A: Upgrade AWS SDK version to 2.21.33 for Amazon S3 Express One Zone support Sub-task Resolved Ahmar Suhail  
          141.
          fs.getXattrs(path) for S3FS doesn't have x-amz-server-side-encryption-aws-kms-key-id header. Sub-task Open Mukund Thakur  
          142.
          S3A: ITestS3AFileContextURI: MultiObjectDeleteException bulk delete of odd filenames Sub-task Open Unassigned  
          143.
          Assist reflection IO with WrappedOperations class Sub-task Open Steve Loughran  

          Activity

            People

              Unassigned Unassigned
              stevel@apache.org Steve Loughran
              Votes:
              1 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 14h 50m
                  14h 50m