Details
-
Bug
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
None
-
None
-
None
Description
The heartbeat monitor thread encounters a ConcurrentModificationException while iterating over the "heartbeats" data structure. This occurs when the namenode was getting restarted. There are actuallt two bugs here:
1. The Heartbeat Monitor thread needs to catch Exceptions and continue, instead of exiting.
2. The heartbeats data structures is protected by the heartbeats lock. The registerDatanode() method invokes removeDatanode() without acquiring the heartbeats monitor lock. This causes the ConcurrentModificationException.
Attachments
Attachments
Issue Links
- is related to
-
HADOOP-1255 Name-node falls into infinite loop trying to remove a dead node.
- Closed