Details

    • Reviewed

    Description

      When YARN reports a completed container to the MR AM, it always interprets it as a failure. This can lead to a job failing because too many of its tasks failed, when in fact they only failed because the scheduler preempted them.

      MR needs to recognize the special exit code value of -100 and interpret it as a container being killed instead of a container failure.

      Attachments

        1. MAPREDUCE-4951-2.patch
          6 kB
          Sandy Ryza
        2. MAPREDUCE-4951-1.patch
          6 kB
          Sandy Ryza
        3. MAPREDUCE-4951.patch
          3 kB
          Sandy Ryza

        Issue Links

          Activity

            People

              sandyr Sandy Ryza
              sandyr Sandy Ryza
              Votes:
              1 Vote for this issue
              Watchers:
              12 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: