Hadoop Map/Reduce
  1. Hadoop Map/Reduce
  2. MAPREDUCE-1119

When tasks fail to report status, show tasks's stack dump before killing

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 0.22.0
    • Fix Version/s: 0.21.0
    • Component/s: tasktracker
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      When the TT kills tasks that haven't reported status, it should somehow gather a stack dump for the task. This could be done either by sending a SIGQUIT (so the dump ends up in stdout) or perhaps something like JDI to gather the stack directly from Java. This may be somewhat tricky since the child may be running as another user (so the SIGQUIT would have to go through LinuxTaskController). This feature would make debugging these kinds of failures much easier, especially if we could somehow get it into the TaskDiagnostic message

      1. MAPREDUCE-1119.2.patch
        13 kB
        Aaron Kimball
      2. MAPREDUCE-1119.3.patch
        26 kB
        Aaron Kimball
      3. MAPREDUCE-1119.4.patch
        27 kB
        Aaron Kimball
      4. MAPREDUCE-1119.5.patch
        26 kB
        Aaron Kimball
      5. MAPREDUCE-1119.6.patch
        31 kB
        Aaron Kimball
      6. MAPREDUCE-1119.patch
        13 kB
        Aaron Kimball

        Issue Links

          Activity

            People

            • Assignee:
              Aaron Kimball
              Reporter:
              Todd Lipcon
            • Votes:
              0 Vote for this issue
              Watchers:
              14 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development