[HIVE-11110] Reorder applyPreJoinOrderingTransforms, add NotNULL/FilterMerge rules, improve Filter selectivity estimation - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.0.0
Component/s: CBO
Labels:
None

Description

Query

select  count(*)
 from store_sales
     ,store_returns
     ,date_dim d1
     ,date_dim d2
 where d1.d_quarter_name = '2000Q1'
   and d1.d_date_sk = ss_sold_date_sk
   and ss_customer_sk = sr_customer_sk
   and ss_item_sk = sr_item_sk
   and ss_ticket_number = sr_ticket_number
   and sr_returned_date_sk = d2.d_date_sk
   and d2.d_quarter_name in ('2000Q1','2000Q2','2000Q3’);

The store_sales table is partitioned on ss_sold_date_sk, which is also used in a join clause. The join clause should add a filter “filterExpr: ss_sold_date_sk is not null”, which should get pushed the MetaStore when fetching the stats. Currently this is not done in CBO planning, which results in the stats from _HIVE_DEFAULT_PARTITION_ to be fetched and considered in the optimization phase. In particular, this increases the NDV for the join columns and may result in wrong planning.

Including HiveJoinAddNotNullRule in the optimization phase solves this issue.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-11110-branch-1.2.patch
07/Jul/15 17:57
9 kB
Ashutosh Chauhan
HIVE-11110-12.patch
15/Sep/15 03:25
179 kB
Laljo John Pullokkaran
HIVE-11110-11.patch
14/Sep/15 17:39
179 kB
Laljo John Pullokkaran
HIVE-11110-10.patch
14/Sep/15 17:38
179 kB
Laljo John Pullokkaran
HIVE-11110.patch
25/Jun/15 14:39
6 kB
jcamachorodriguez
HIVE-11110.92.patch
10/Sep/15 22:01
164 kB
Hari Sankar Sivarama Subramaniyan
HIVE-11110.91.patch
03/Sep/15 20:52
172 kB
Hari Sankar Sivarama Subramaniyan
HIVE-11110.9.patch
01/Sep/15 23:12
132 kB
Hari Sankar Sivarama Subramaniyan
HIVE-11110.8.patch
31/Aug/15 18:26
13 kB
Hari Sankar Sivarama Subramaniyan
HIVE-11110.7.patch
10/Jul/15 23:57
13 kB
Laljo John Pullokkaran
HIVE-11110.6.patch
07/Jul/15 16:34
29 kB
Ashutosh Chauhan
HIVE-11110.5.patch
02/Jul/15 01:50
29 kB
Ashutosh Chauhan
HIVE-11110.4.patch
02/Jul/15 00:28
29 kB
Ashutosh Chauhan
HIVE-11110.35.patch
12/Dec/15 00:27
7.78 MB
Laljo John Pullokkaran
HIVE-11110.34.patch
11/Dec/15 06:43
7.78 MB
Laljo John Pullokkaran
HIVE-11110.33.patch
09/Dec/15 03:52
7.78 MB
Laljo John Pullokkaran
HIVE-11110.32.patch
08/Dec/15 21:59
7.77 MB
Laljo John Pullokkaran
HIVE-11110.31.patch
08/Dec/15 02:26
7.77 MB
Laljo John Pullokkaran
HIVE-11110.30.patch
04/Dec/15 20:56
7.04 MB
Laljo John Pullokkaran
HIVE-11110.29.patch
03/Dec/15 03:29
7.04 MB
Laljo John Pullokkaran
HIVE-11110.28.patch
01/Dec/15 03:40
7.73 MB
Laljo John Pullokkaran
HIVE-11110.27.patch
30/Nov/15 18:23
7.72 MB
Laljo John Pullokkaran
HIVE-11110.27
30/Nov/15 07:29
7.72 MB
Laljo John Pullokkaran
HIVE-11110.26.patch
26/Nov/15 04:26
7.14 MB
Laljo John Pullokkaran
HIVE-11110.25.patch
24/Nov/15 08:50
7.26 MB
Laljo John Pullokkaran
HIVE-11110.24.patch
16/Nov/15 06:09
6.96 MB
Laljo John Pullokkaran
HIVE-11110.23.patch
13/Nov/15 02:38
6.95 MB
Laljo John Pullokkaran
HIVE-11110.22.patch
11/Nov/15 03:17
6.87 MB
Laljo John Pullokkaran
HIVE-11110.21.patch
07/Nov/15 05:03
6.76 MB
Laljo John Pullokkaran
HIVE-11110.20.patch
27/Oct/15 06:12
2.82 MB
Laljo John Pullokkaran
HIVE-11110.2.patch
01/Jul/15 21:06
6 kB
Ashutosh Chauhan
HIVE-11110.19.patch
16/Oct/15 01:52
169 kB
Laljo John Pullokkaran
HIVE-11110.18.patch
15/Oct/15 01:40
163 kB
Laljo John Pullokkaran
HIVE-11110.17.patch
24/Sep/15 23:23
168 kB
Laljo John Pullokkaran
HIVE-11110.16.patch
24/Sep/15 02:40
167 kB
Laljo John Pullokkaran
HIVE-11110.15.patch
21/Sep/15 22:04
161 kB
Laljo John Pullokkaran
HIVE-11110.14.patch
17/Sep/15 04:01
179 kB
Laljo John Pullokkaran
HIVE-11110.13.patch
16/Sep/15 06:24
0.9 kB
Laljo John Pullokkaran
HIVE-11110.1.patch
27/Jun/15 07:20
32 kB
Laljo John Pullokkaran

Issue Links

blocks

HIVE-11865 Disable Hive PPD optimizer when CBO has optimized the plan

Closed

is related to

HIVE-12478 Improve Hive/Calcite Transitive Predicate inference

Closed

relates to

HIVE-11764 Verify the correctness of groupby_cube1.q with MR, Tez and Spark Mode with HIVE-11110

Open

HIVE-11918 Implement/Enable constant related optimization rules in Calcite

Resolved

requires

HIVE-11151 Calcite transitive predicate inference rule should not transitively add not null filter on non-nullable input

Closed

HIVE-11152 Swapping join inputs in ASTConverter

Closed

links to

(1 requires, 1 links to)

Activity

People

Assignee:: Laljo John Pullokkaran

Reporter:: Jesús Camacho Rodríguez

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 25/Jun/15 14:36

Updated:: 27/Feb/24 22:24

Resolved:: 12/Dec/15 06:55