Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-1435

Provide an option to store fsimage compressed

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.22.0
    • 0.22.0
    • namenode
    • None
    • Incompatible change, Reviewed
    • This provides an option to store fsimage compressed. The layout version is bumped to -25. The user could configure if s/he wants the fsimage to be compressed or not and which codec to use. By default the fsimage is not compressed.

    Description

      Our HDFS has fsimage as big as 20G bytes. It consumes a lot of network bandwidth when secondary NN uploads a new fsimage to primary NN.

      If we could store fsimage compressed, the problem could be greatly alleviated.

      I plan to provide a new configuration hdfs.image.compressed with a default value of false. If it is set to be true, fsimage is stored as compressed.

      The fsimage will have a new layout with a new field "compressed" in its header, indicating if the namespace is stored compressed or not.

      Attachments

        1. trunkImageCompress4.patch
          17 kB
          Hairong Kuang
        2. trunkImageCompress3.patch
          17 kB
          Hairong Kuang
        3. trunkImageCompress2.patch
          17 kB
          Hairong Kuang
        4. trunkImageCompress1.patch
          16 kB
          Hairong Kuang
        5. trunkImageCompress.patch
          14 kB
          Hairong Kuang
        6. checkpoint-limitandcompress.patch
          13 kB
          Yilei Lu

        Issue Links

          Activity

            People

              hairong Hairong Kuang
              hairong Hairong Kuang
              Votes:
              0 Vote for this issue
              Watchers:
              13 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: