Details

    • Type: Sub-task
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: namenode
    • Labels:
      None

      Description

      During namenode startup, the load will be very high since it has to process every datanodes blockreport one by one. If there are hundreds datanodes block reports pending process, the issue will be more serious even #processFirstBlockReport is processed a lot more efficiently than ordinary block reports. Then some of datanode will retry blockreport and lengthens restart times. I think we should filter the block report request (via datanode blockreport retries) which has be processed and return directly then shorten down restart time. I want to state this proposal may be obvious only for large cluster.

        Attachments

          Activity

            People

            • Assignee:
              hexiaoqiao He Xiaoqiao
              Reporter:
              hexiaoqiao He Xiaoqiao
            • Votes:
              1 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

              • Created:
                Updated: