Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.23.3, 2.0.2-alpha
    • Fix Version/s: 3.0.0, 2.0.3-alpha, 0.23.5
    • Component/s: None
    • Labels:
      None

      Description

      We found some jobs were stuck in KILL_WAIT for days on end. The RM shows them as RUNNING. When you go to the AM, it shows it in the KILL_WAIT state, and a few maps running. All these maps were scheduled on nodes which are now in the RM's Lost nodes list. The running maps are in the FAIL_CONTAINER_CLEANUP state

      1. MAPREDUCE-4751-20121108.txt
        12 kB
        Vinod Kumar Vavilapalli
      2. MAPREDUCE-4751-20121109.txt
        22 kB
        Vinod Kumar Vavilapalli
      3. MR-4751-branch-0.23.txt
        22 kB
        Robert Joseph Evans
      4. TaskAttemptStateGraph.jpg
        417 kB
        Ravi Prakash

        Issue Links

          Activity

            People

            • Assignee:
              Vinod Kumar Vavilapalli
              Reporter:
              Ravi Prakash
            • Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development