Hadoop Map/Reduce
  1. Hadoop Map/Reduce
  2. MAPREDUCE-5718

MR AM should tolerate RM restart/failover during commit

    Details

    • Type: Bug Bug
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 2.3.0
    • Fix Version/s: None
    • Component/s: mr-am
    • Labels:
    • Target Version/s:

      Description

      While testing RM HA, we ran into this issue where if the RM fails over while an MR AM is in the middle of a commit, the subsequent AM gets spawned but dies with a diagnostic message - "We crashed durring a commit".

      1. mr-5718-0.patch
        2 kB
        Karthik Kambatla

        Issue Links

          Activity

            People

            • Assignee:
              Karthik Kambatla
              Reporter:
              Karthik Kambatla
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:

                Development