Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
2.2.0
-
None
Description
On a large cluster with heavy load on NameNode, sometimes there happens to be greater latency for Standby NameNode in responding back its JMX metrics which results in timeout for the JMX call dome by ambari-server and no JMX data for standby NameNode is received sometimes.
Thus Standby NameNode in such environment responses sometimes by declaring itself standby and sometimes doesn't respond quick enough before timeout happens.
Due to this behavior ambari-web keeps alternating frequently between “standby namenode” and “namenode”, even though there is no failover.
As a fix ambari if detects any one namenode to be active should consider other to be standby even though JMX call to standby namenode has timedout.
Attachments
Attachments
Issue Links
- links to