Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
Impala 2.8.0
Description
Apache Parquet have some nice performance improvements to bit-packed decoding in their Parquet scanner (which is derived from Impala's): https://github.com/apache/parquet-cpp/pull/140
We should do something similar - i.e. switch to more of a batch-oriented approach to decoding rather than value-at-a-time
Attachments
Attachments
Issue Links
- is a parent of
-
IMPALA-4177 Add batch dictionary/RLE decoding in Parquet
- Resolved
- is related to
-
IMPALA-4864 Speed up binary predicates against dictionary encoded Parquet data by converting the predicates to their codewords
- Open