Details
-
Bug
-
Status: Patch Available
-
Major
-
Resolution: Unresolved
-
None
-
None
Description
In a cluster with more than one Standby namenode, manual failover throws exception for some cases
When trying to exectue the failover command from active to standby
./hdfs haadmin -failover nn1 nn2, below Exception is thrown
Operation failed: Call From X-X-X-X/X-X-X-X to Y-Y-Y-Y:nnnn failed on connection exception: java.net.ConnectException: Connection refused
This is encountered in the following cases :
Scenario 1 :
Namenodes - NN1(Active) , NN2(Standby), NN3(Standby)
When trying to manually failover from NN1 to NN2 if NN3 is down, Exception is thrown
Scenario 2 :
Namenodes - NN1(Active) , NN2(Standby), NN3(Standby)
ZKFC's - ZKFC1, ZKFC2, ZKFC3
When trying to manually failover using NN1 to NN3 if NN3's ZKFC (ZKFC3) is down, Exception is thrown