Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.0.2-alpha, 3.0.0-alpha1
-
None
-
Reviewed
Description
When the EditLogTailer thread calls rollEdits() on the active NN via RPC, it currently does so without a timeout. So, if the active NN has frozen (but not actually crashed), this call can hang forever. This can then potentially prevent the standby from becoming active.
This may actually considered a side effect of HADOOP-6762 – if the RPC were interruptible, that would also fix the issue.
Attachments
Attachments
Issue Links
- is depended upon by
-
HDFS-10734 Rename "dfs.ha.tail-edits.rolledits.timeout" to "dfs.ha.log-roll.execution.timeout"
- Open