[HDFS-7980] Incremental BlockReport will dramatically slow down the startup of a namenode - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.6.1, 2.8.0, 2.7.1, 3.0.0-alpha1
Component/s: None
Labels:
- 2.6.1-candidate

Hadoop Flags:

Reviewed

Description

In the current implementation the datanode will call the reportReceivedDeletedBlocks() method that is a IncrementalBlockReport before calling the bpNamenode.blockReport() method. So in a large(several thousands of datanodes) and busy cluster it will slow down(more than one hour) the startup of namenode.

List<DatanodeCommand> blockReport() throws IOException {
    // send block report if timer has expired.
    final long startTime = now();
    if (startTime - lastBlockReport <= dnConf.blockReportInterval) {
      return null;
    }

    final ArrayList<DatanodeCommand> cmds = new ArrayList<DatanodeCommand>();

    // Flush any block information that precedes the block report. Otherwise
    // we have a chance that we will miss the delHint information
    // or we will report an RBW replica after the BlockReport already reports
    // a FINALIZED one.
    reportReceivedDeletedBlocks();
    lastDeletedReport = startTime;
    .........
        // Send the reports to the NN.
    int numReportsSent = 0;
    int numRPCs = 0;
    boolean success = false;
    long brSendStartTime = now();
    try {
      if (totalBlockCount < dnConf.blockReportSplitThreshold) {
        // Below split threshold, send all reports in a single message.
        DatanodeCommand cmd = bpNamenode.blockReport(
            bpRegistration, bpos.getBlockPoolId(), reports);

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HDFS-7980.001.patch
26/Mar/15 12:53
0.9 kB
Walter Su
HDFS-7980.002.patch
08/Apr/15 09:40
7 kB
Walter Su
HDFS-7980.003.patch
28/Apr/15 03:38
8 kB
Walter Su
HDFS-7980.004.patch
29/Apr/15 02:41
9 kB
Walter Su
HDFS-7980.004.repost.patch
06/May/15 12:36
9 kB
Walter Su
HDFS-7980-branch-2.6.1.txt
07/Sep/15 18:50
9 kB
Vinod Kumar Vavilapalli

Issue Links

relates to

HDFS-8380 Always call addStoredBlock on blocks which have been shifted from one storage to another

Resolved

Activity

People

Assignee:: Walter Su

Reporter:: Hui Zheng

Votes:: 0 Vote for this issue

Watchers:: 18 Start watching this issue

Dates

Created:: 24/Mar/15 10:36

Updated:: 06/Jan/17 01:38

Resolved:: 07/May/15 18:40