Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
HA branch (HDFS-1623)
-
None
-
None
-
Reviewed
Description
Here's a potential bug case that we don't currently test for:
- SBN is reading a finalized edits file when NFS disappears halfway through (or some intermittent error happens)
- SBN performs a checkpoint and uploads it to the NN
- NN receives a checkpoint that doesn't correspond to the end of any log segment
- Both NN and SBN should be able to restart at this point.