Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
As per discussion in HDFS-9917 and comment from vinayrpet
Avoid accumulation of IBRs when the standby is down for long time, can we consider as below.
1. IBRs for StandbyNN can have a threshold ( say 100K or 1Million IBRs ).
2. Also not to loose any important IBRs, IBRs can be cleared when "the threshold is reached AND 'lastIBR' is more than 'heartbeatExpiryInterval'. i.e. DataNode is considered dead in Namenode side". In that case, for sure re-Register() will be called on reconnection to running NameNode (if any).