The current Cluster status :
137407265 files and directories, 129614074 blocks = 267021339 total filesystem object(s).
The checkpoint save namespace cost more than 5 min.
DataNode rpc timeout.
Standby NameNode skip the DataNode rpc request(because datanode rpc timeout , datanode close the socket channel).
There are many corrupt files when failover.
So, Checkpoint may be done by other component, not Standby NameNode.
- is duplicated by
HDFS-7097 Allow block reports to be processed during checkpointing on standby name node