Details
-
Improvement
-
Status: Resolved
-
Blocker
-
Resolution: Duplicate
-
0.20.2
-
None
-
None
Description
I've heard of several Hadoop users using dfsadmin -report to monitor the number of dead nodes, and alert if that number is not 0. This mechanism tends to work pretty well, except when a node is decommissioned or fails, because then the namenode requires a restart for said node to be entirely removed from HDFS. More details here:
Removal from the exclude file and a refresh should get rid of the dead node.