Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
ghx-label-13
Description
Puffin File format is an iceberg upstream spec to support stats for iceberg tables. These stats cannot be both read and used for query planning and written by Impala today. We want to extend support in Impala to do the below -
- Read stats from Puffin files
- Write stats to puffin files during load/insert/update/delete commands (as applicable)
- Modify compute stats command for iceberg tables to compute stats and store them in Puffin files