Uploaded image for project: 'Apache Avro'
  1. Apache Avro
  2. AVRO-1243

Avro support for all compression codecs

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 1.7.3
    • 1.7.4
    • java
    • None
    • This should have no incompatible changes. The only thing is files saved with the BZip2 codec will not be readable from earlier versions of Avro.

    Description

      I may be reading this wrong but at this time org.apache.avro.file.CodecFactory only supports null, deflate, and snappy compression codecs.

      I would like to change the fromString method to use Class.forName(codec).newInstance(); after the codec was not found in the REGISTERED map but before the AvroRuntimeException is thrown.

      Here are some of my supporting thoughts
      1. This should not interduce much slowness because it will only be called initialize.
      2. This will allow for support for GZip, BZip2, and LZO with out adding more dependances to the maven pom file.
      3. This will allow for a future Jiri I would like to do that would allow AvroOutputFormat to be able to use the following configs: mapred.output.compress and mapred.output.compression.codec

      Attachments

        1. AVRO-1243.patch
          25 kB
          Theodore michael Malaska
        2. AVRO-1243.not-ready.patch
          9 kB
          Theodore michael Malaska
        3. AVRO-1243.not-ready.2.patch
          13 kB
          Theodore michael Malaska
        4. AVRO-1243.not-ready.1.patch
          17 kB
          Theodore michael Malaska
        5. AVRO-1243.1.patch
          29 kB
          Theodore michael Malaska

        Activity

          People

            ted.m Theodore michael Malaska
            ted.m Theodore michael Malaska
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: