Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-1591

Enable bzip compression in benchmark

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 2.9, 3.1, 4.0-ALPHA
    • modules/benchmark
    • None
    • New, Patch Available

    Description

      bzip compression can aid the benchmark package by not requiring extracting bzip files (such as enwiki) in order to index them. The plan is to add a config parameter bzip.compression=true/false and in the relevant tasks either decompress the input file or compress the output file using the bzip streams.
      It will add a dependency on ant.jar which contains two classes similar to GZIPOutputStream and GZIPInputStream which compress/decompress files using the bzip algorithm.

      bzip is known to be superior in its compression performance to the gzip algorithm (~20% better compression), although it does the compression/decompression a bit slower.

      I wil post a patch which adds this parameter and implement it in LineDocMaker, EnwikiDocMaker and WriteLineDoc task. Maybe even add the capability to DocMaker or some of the super classes, so it can be inherited by all sub-classes.

      Attachments

        1. LUCENE-1591.patch
          15 kB
          Shai Erera
        2. LUCENE-1591.patch
          20 kB
          Shai Erera
        3. LUCENE-1591.patch
          21 kB
          Shai Erera
        4. LUCENE-1591.patch
          35 kB
          Shai Erera
        5. LUCENE-1591.patch
          45 kB
          Shai Erera
        6. LUCENE-1591.patch
          47 kB
          Shai Erera
        7. LUCENE-1591.patch
          47 kB
          Shai Erera
        8. LUCENE-1591.patch
          2 kB
          Mark Miller
        9. commons-compress-dev20090413.jar
          137 kB
          Uwe Schindler
        10. commons-compress-dev20090413.jar
          137 kB
          Uwe Schindler

        Activity

          People

            markrmiller@gmail.com Mark Miller
            shaie Shai Erera
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: