Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
0.23.0
-
None
-
Reviewed
Description
The namenode restart is dominated by the performance of processing block reports. On a 2000 node cluster with 90 million blocks, block report processing takes 30 to 40 minutes. The namenode "diffs" the contents of the incoming block report with the contents of the blocks map, and then applies these diffs to the blocksMap, but in reality there is no need to compute the "diff" because this is the first block report from the datanode.
This code change improves block report processing time by 300%.
Attachments
Attachments
Issue Links
- is duplicated by
-
HDFS-1147 Reduce NN startup time by reducing the processing time of block reports
- Resolved