Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-1312

heartbeat monitor thread goes away

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Fixed
    • None
    • 0.13.0
    • None
    • None

    Description

      The heartbeat monitor thread encounters a ConcurrentModificationException while iterating over the "heartbeats" data structure. This occurs when the namenode was getting restarted. There are actuallt two bugs here:

      1. The Heartbeat Monitor thread needs to catch Exceptions and continue, instead of exiting.
      2. The heartbeats data structures is protected by the heartbeats lock. The registerDatanode() method invokes removeDatanode() without acquiring the heartbeats monitor lock. This causes the ConcurrentModificationException.

      Attachments

        1. heartbeatmonitor-0.12.3.patch
          3 kB
          Dhruba Borthakur
        2. heartbeatmonitor3.patch
          3 kB
          Dhruba Borthakur

        Issue Links

          Activity

            People

              dhruba Dhruba Borthakur
              dhruba Dhruba Borthakur
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: