Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-13700

The process of loading image can be done in a pipeline model

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      The process of loading a file system image involves reading inodes section, deserializing inodes, initializing inodes, adding inodes to the global map, reading directories section, adding inodes to their parents' map, cache name etc. These steps can be done in a pipeline model to reduce the total duration. 

      Test the patch against a fsimage of a 70PB  cluster (200million files and 300million blocks, the fsimage is around 22GB), the image loading time be reduced from 1210 seconds to 739 seconds.

      Attachments

        1. HDFS-13700-001.patch
          24 kB
          zhouyingchao
        2. HDFS-13700.002.patch
          33 kB
          Lisheng Sun

        Activity

          People

            leosun08 Lisheng Sun
            sinago zhouyingchao
            Votes:
            0 Vote for this issue
            Watchers:
            11 Start watching this issue

            Dates

              Created:
              Updated: