Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-5170

Streaming gzip decompression for Parquet and Avro files

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • Impala 2.5.0, Impala 2.6.0, Impala 2.7.0, Impala 2.8.0
    • None
    • Backend
    • ghx-label-2

    Description

      To reduce the memory consumption of scans over gzip-compressed data, we should implement streaming decompression (gzip supports it).

      Note that our text scanners already perform streaming decompression by default, but that's not the case for other scanners (e.g. Parquet/Avro).

      Attachments

        Activity

          People

            Unassigned Unassigned
            alex.behm Alexander Behm
            Votes:
            1 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: