Details
-
Bug
-
Status: Closed
-
Blocker
-
Resolution: Duplicate
-
0.1.1
-
None
-
None
Description
some hours after adding some new nodes to the cluster, the name node went into a state where it's consuming 100% cpu.
The log file keeps logging messages of the forms
060421 155049 Obsoleting block blk_8093115169359854355
060421 155049 Pending transfer (block blk_-6965677235456960523) from node1383:50010 to 2 destinations
060421 155049 Block report from node1283:50010: 2140 blocks.
060421 155049 Redundant addStoredBlock request received for block blk_-6836937139917042917 on node node1143:50010
many DFS operations time out, making useful work impossible.
restarting dfs solved the problem for a while, but it came back within an hour.