Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-2051

JobTracker's TaskCommitQueue is vulnerable to non-IOExceptions

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: 0.15.0
    • Fix Version/s: 0.15.0
    • Component/s: None
    • Labels:
      None

      Description

      The JobTracker#TaskCommitQueue#run method only handles IOException}}s. Christian Kunz ran into a scenario where a job was stuck with all tasks in {{COMMIT_PENDING state and the stack traces showed that the "Task Commit Thread" wasn't even around.

      The work-around is to model TaskCommitQueue#run along the lines of other long-running threads in the JobTracer (ExpireLaunchingTasks, ExpireTrackers etc.) to catch, log and ignore any Exception in a loop.

        Attachments

        1. HADOOP-2051_2_20071016.patch
          11 kB
          Arun Murthy
        2. HADOOP-2051_1_20071013.patch
          12 kB
          Arun Murthy

          Activity

            People

            • Assignee:
              acmurthy Arun Murthy
              Reporter:
              acmurthy Arun Murthy
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: