Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-7053

Timed out tasks can fail to produce thread dump

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 3.1.0, 3.0.1, 2.10.0, 2.9.1, 2.8.4
    • 3.1.0, 2.10.0, 2.9.1, 2.8.4, 3.0.3
    • None
    • None

    Description

      TestMRJobs#testThreadDumpOnTaskTimeout has been failing sporadically recently. When the AM times out a task it immediately removes it from the list of known tasks and then connects to the NM to request a thread dump followed by a kill. If the task heartbeats in after the task has been removed from the list of known tasks but before the thread dump signal arrives then the task can exit with a "org.apache.hadoop.mapred.Task: Parent died." message and no thread dump.

      Attachments

        1. MAPREDUCE-7053.001.patch
          14 kB
          Jason Darrell Lowe
        2. MAPREDUCE-7053-branch-2.001.patch
          14 kB
          Jason Darrell Lowe

        Issue Links

          Activity

            People

              jlowe Jason Darrell Lowe
              jlowe Jason Darrell Lowe
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: