Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-2051

JobTracker's TaskCommitQueue is vulnerable to non-IOExceptions

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • 0.15.0
    • 0.15.0
    • None
    • None

    Description

      The JobTracker#TaskCommitQueue#run method only handles IOException}}s. Christian Kunz ran into a scenario where a job was stuck with all tasks in {{COMMIT_PENDING state and the stack traces showed that the "Task Commit Thread" wasn't even around.

      The work-around is to model TaskCommitQueue#run along the lines of other long-running threads in the JobTracer (ExpireLaunchingTasks, ExpireTrackers etc.) to catch, log and ignore any Exception in a loop.

      Attachments

        1. HADOOP-2051_2_20071016.patch
          11 kB
          Arun Murthy
        2. HADOOP-2051_1_20071013.patch
          12 kB
          Arun Murthy

        Activity

          People

            acmurthy Arun Murthy
            acmurthy Arun Murthy
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: