Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
ghx-label-5
Description
PARQUET-970 added these codecs to the format. We have LZ4 in the toolchain already and I just added zstd: https://gerrit.cloudera.org/#/c/13079/
These codec probably offer a better trade-off of density and speed than snappy or gzip.
https://github.com/apache/arrow/pull/807/files might be a useful crib sheet for how to add a compressor.
LZ4 support will be added using: https://issues.apache.org/jira/browse/IMPALA-8617