Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
Edit log branch (HDFS-1073)
-
None
-
Reviewed
Description
HDFS-1984 added the ability to run two secondary namenodes at the same time. However, there were two races I found when stress testing this (by running two NNs each checkpointing in a tight loop with no sleep):
1) the writing of the seen_txid file was not atomic, so it was at some points reading an empty file
2) it was possible for two checkpointers to try to take a checkpoint at the same transaction ID, which would cause the two image downloads to collide and fail