Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
Impala 2.10.0
-
ghx-label-6
Description
Column chunks that only contain NULL values don't have their min_value and max_value fields populated and as a result we cannot skip row groups based on predicates on such columns. IMPALA-5061 added support to populate the null_count in statistics, allowing us to detect column chunks that only contain NULLs. We should use that information to skip row groups if the predicate allows us to.
Attachments
Issue Links
- is a child of
-
IMPALA-4989 Improve filtering based on parquet::Statistics
- Resolved