Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
1.2.1
-
None
-
None
-
Reviewed
Description
Our cluster have 40G fsimage, we write one copy of edit log to NFS.
After NFS temporary failed, when doing checkpoint, NameNode try to recover it, and it will save 40G fsimage to NFS, it takes some time (> 40G/128MB/s = 320 seconds) , and it locked FSNamesystem, and this bring down our cluster.
Attachments
Attachments
Issue Links
- relates to
-
HDFS-5396 FSImage.getFsImageName should check whether fsimage exists
- Resolved