Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Duplicate
-
2.7.0
-
None
-
None
-
None
Description
Number of DN = 2
Step 1: Write a file with replication factor - 3 .
Step 2: Corrupt a replica in DN1
Step 3: DN2 is down.
Missing Block count in report is as follows
Fsck report : 0
Jmx, "dfsadmin -report" , UI, logs : 1
In fsck , only block whose replicas are all missed and not been corrupted are counted
if (totalReplicasPerBlock == 0 && !isCorrupt) { // If the block is corrupted, it means all its available replicas are // corrupted. We don't mark it as missing given these available replicas // might still be accessible as the block might be incorrectly marked as // corrupted by client machines.
While in other reports even if all the replicas are corrupted , block is been considered as missed.
Please provide your thoughts : can we make missing block count consistent across all the reports same as implemented for fsck?
Attachments
Issue Links
- is duplicated by
-
HDFS-10213 there is a corrupt/missing block report, but fsck result is HEALTHY
- Resolved
- is related to
-
HDFS-13999 Bogus missing block warning if the file is under construction when NN starts
- Resolved