VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Reviewed

    Description

      When YARN reports a completed container to the MR AM, it always interprets it as a failure. This can lead to a job failing because too many of its tasks failed, when in fact they only failed because the scheduler preempted them.

      MR needs to recognize the special exit code value of -100 and interpret it as a container being killed instead of a container failure.

      Attachments

        1. MAPREDUCE-4951.patch
          3 kB
          Sandy Ryza
        2. MAPREDUCE-4951-1.patch
          6 kB
          Sandy Ryza
        3. MAPREDUCE-4951-2.patch
          6 kB
          Sandy Ryza

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            sandyr Sandy Ryza
            sandyr Sandy Ryza
            Votes:
            1 Vote for this issue
            Watchers:
            12 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment