Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-8865

Improve quota initialization performance

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.8.0, 3.0.0-alpha1, 2.6.6, 2.7.5
    • None
    • None
    • Reviewed

    Description

      After replaying edits, the whole file system tree is recursively scanned in order to initialize the quota. For big name space, this can take a very long time. Since this is done during namenode failover, it also affects failover latency.

      By using the Fork-Join framework, I was able to greatly reduce the initialization time. The following is the test result using the fsimage from one of the big name nodes we have.

      threads seconds
      1 (existing) 55
      1 (fork-join) 68
      4 16
      8 8
      12 6
      16 5
      20 4

      Attachments

        1. HDFS-8865_branch-2.6.patch
          11 kB
          Nicolas Fraison
        2. HDFS-8865_branch-2.7.patch
          18 kB
          Kihwal Lee
        3. HDFS-8865.branch-2.6.01.patch
          15 kB
          Xiao Chen
        4. HDFS-8865.branch-2.6.patch
          21 kB
          Xiao Chen
        5. HDFS-8865.branch-2.7.patch
          19 kB
          Xiao Chen
        6. HDFS-8865.patch
          18 kB
          Kihwal Lee
        7. HDFS-8865.v2.checkstyle.patch
          18 kB
          Kihwal Lee
        8. HDFS-8865.v2.patch
          18 kB
          Kihwal Lee
        9. HDFS-8865.v3.patch
          18 kB
          Kihwal Lee

        Issue Links

          Activity

            People

              kihwal Kihwal Lee
              kihwal Kihwal Lee
              Votes:
              0 Vote for this issue
              Watchers:
              17 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: