Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
4.0.0
-
None
Description
explain cbo with ss as (select count(1), ss_item_sk, ss_ticket_number from store_sales group by ss_item_sk, ss_ticket_number having count(1) > 1) select count(1) from item where i_item_sk IN (select ss_item_sk from ss);
Notice the HiveProject(ss_item_sk=[$0], ss_ticket_number=[$1], $f2=[$2])
Only ss_item_sk is relevant for the HiveSemiJoin
CBO PLAN:
HiveAggregate(group=[{}], agg#0=[count()])
HiveSemiJoin(condition=[=($0, $1)], joinType=[inner])
HiveProject(i_item_sk=[$0])
HiveFilter(condition=[IS NOT NULL($0)])
HiveTableScan(table=[[tpcds_copy_orc_partitioned_10000, item]], table:alias=[item])
HiveProject(ss_item_sk=[$0], ss_ticket_number=[$1], $f2=[$2])
HiveFilter(condition=[>($2, 1)])
HiveAggregate(group=[{1, 8}], agg#0=[count()])
HiveFilter(condition=[IS NOT NULL($1)])
HiveTableScan(table=[[tpcds_copy_orc_partitioned_10000, store_sales]], table:alias=[store_sales])
Attachments
Attachments
Issue Links
- relates to
-
HIVE-21381 Improve column pruning
- Open
-
HIVE-21382 Group by keys reduction optimization - keys are not reduced in query23
- Closed