Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-2225

Enable compression in HBase Export

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Duplicate
    • 0.20.1
    • 0.90.0
    • util
    • None
    • OS agnostic

    • HBase Export tool now uses GzipCodec to compress its output.

    Description

      org.apache.hadoop.hbase.mapreduce.Export should set compression codec

      In createSubmittableJob(), the following should be added:
      FileOutputFormat.setCompressOutput(job, true);
      FileOutputFormat.setOutputCompressorClass(job, org.apache.hadoop.io.compress.GzipCodec.class);

      From my experiment, 10% to 50% reduction in Export output has been observed.

      SequenceFileInputFormat used by the Import tool is able to detect GzipCodec - there is no change for Import class.

      Attachments

        1. HBASE-2225-trunk.patch
          3 kB
          Lars George
        2. HBASE-2225-v2-trunk.patch
          9 kB
          Lars George

        Issue Links

          Activity

            People

              kannanm Kannan Muthukkaruppan
              ted_yu Ted Yu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - 0.5h
                  0.5h
                  Remaining:
                  Remaining Estimate - 0.5h
                  0.5h
                  Logged:
                  Time Spent - Not Specified
                  Not Specified