Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-15620 Über-jira: S3A phase VI: Hadoop 3.3 features
  3. HADOOP-15961

S3A committers: make sure there's regular progress() calls

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 3.3.0
    • fs/s3
    • None

    Description

      MAPREDUCE-7164 highlights how inside job/task commit more context.progress() callbacks are needed, just for HDFS.

      the S3A committers should be reviewed similarly.

      At a glance:

      StagingCommitter.commitTaskInternal() is at risk if a task write upload enough data to the localfs that the upload takes longer than the timeout.

      it should call progress it every single file commits, or better: modify uploadFileToPendingCommit to take a Progressable for progress callbacks after every part upload.

      Attachments

        1. HADOOP-15961-001.patch
          7 kB
          lqjacklee
        2. HADOOP-15961-002.patch
          11 kB
          lqjacklee
        3. HADOOP-15961-003.patch
          9 kB
          lqjacklee

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Jack-Lee lqjacklee
            stevel@apache.org Steve Loughran
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment