Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-137

Different TaskTrackers may get the same task tracker id, thus cause many problems.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 0.2.0
    • 0.2.0
    • None
    • None

    Description

      In the TaskTracker#Initialize method, the following line assigns task tracker name (id):

      this.taskTrackerName = "tracker_" + (Math.abs(r.nextInt()) % 100000);

      For a fair size cluster, it is possible that different task trackers to get the same id, causing name conflict.
      I encountered this problem with a cluster of 274 nodes. Once such conflict happens, a lot of strange things may happen.
      For example, a reducer task tried to copy from a machine (task tracker) a map output file that was actually produced
      on another machine.

      Attachments

        Issue Links

          Activity

            People

              omalley Owen O'Malley
              runping Runping Qi
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: