Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-1247

Send out-of-band heartbeat to avoid fake lost tasktracker

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • None
    • None
    • None

    Description

      Currently the TaskTracker report task status to jobtracker through heartbeat, sometimes if the tasktracker lock the tasktracker to do some cleanup job, like remove task temp data on disk, the heartbeat thread would hang for a long time while waiting for the lock, so the jobtracker just thought it had lost and would reschedule all its finished maps or un finished reduce on other tasktrackers, we call it "fake lost tasktracker", some times it doesn't acceptable especially when we run some large jobs. So We introduce a out-of-band heartbeat mechanism to send an out-of-band heartbeat in that case.

      Attachments

        Issue Links

          Activity

            People

              buptzhugy ZhuGuanyin
              buptzhugy ZhuGuanyin
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: