Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
ghx-label-11
Description
Parquet stores min/max stats for pages which can be used to skip certain pages if they don't qualify an equi-join predicate.
The query below ends up scanning all rows for table a, which may not be needed if the min/max of b.ss_addr_sk can be detected and used during the scan of a.
select a.ss_sold_time_sk from store_sales a join [SHUFFLE] store_sales b where a.ss_addr_sk = b.ss_addr_sk and b.ss_customer_sk < 10 ;
Attachments
Issue Links
- relates to
-
IMPALA-3430 Runtime filter : Extend runtime filter to support Min/Max values for HDFS scans
- Resolved