[HIVE-17626] Query reoptimization using cached runtime statistics - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 3.0.0
Fix Version/s: 3.0.0
Component/s: Logical Optimizer
Labels:
- TODOC3.0

Target Version/s:

3.0.0

Description

Something similar to "EXPLAIN ANALYZE" where we annotate explain plan with actual and estimated statistics. The runtime stats can be cached at query level and subsequent execution of the same query can make use of the cached statistics from the previous run for better optimization.
Some use cases,
1) re-planning join query (mapjoin failures can be converted to shuffle joins)
2) better statistics for table scan operator if dynamic partition pruning is involved
3) Better estimates for bloom filter initialization (setting expected entries during merge)

This can extended to support wider queries by caching fragments of operator plans scanning same table(s) or matching some operator sequences.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-17626.01.patch
26/Feb/18 12:21
181 kB
Zoltan Haindrich
HIVE-17626.01wip01.patch
20/Feb/18 11:05
143 kB
Zoltan Haindrich
HIVE-17626.02.patch
26/Feb/18 18:18
268 kB
Zoltan Haindrich
HIVE-17626.03.patch
27/Feb/18 11:16
232 kB
Zoltan Haindrich
HIVE-17626.04.patch
27/Feb/18 16:12
248 kB
Zoltan Haindrich
HIVE-17626.05.patch
27/Feb/18 18:35
248 kB
Zoltan Haindrich
HIVE-17626.06.patch
01/Mar/18 15:31
301 kB
Zoltan Haindrich
HIVE-17626.07A.patch
04/Mar/18 08:31
302 kB
Zoltan Haindrich
HIVE-17626.07B.patch
04/Mar/18 08:19
299 kB
Zoltan Haindrich
HIVE-17626.08.patch
05/Mar/18 08:01
303 kB
Zoltan Haindrich
HIVE-17626.09.patch
05/Mar/18 11:49
304 kB
Zoltan Haindrich
HIVE-17626.10.patch
05/Mar/18 22:09
304 kB
Zoltan Haindrich
HIVE-17626.11.patch
06/Mar/18 11:02
302 kB
Zoltan Haindrich
runtimestats.patch
27/Sep/17 23:25
46 kB
Prasanth Jayachandran

Issue Links

links to

Sub-Tasks

1.	Cleanup unused methods in Driver	Closed	Zoltan Haindrich
2.	remove PostExecute / PreExecute hook support	Closed	Zoltan Haindrich
3.	Remove CommandNeedRetryException	Closed	Zoltan Haindrich
4.	Make CommandProcessorResponse an exception instead of a return class	Closed	Zoltan Haindrich
5.	Introduce interface above driver	Closed	Zoltan Haindrich
6.	Driver execution may not have configuration changing sideeffects	Closed	Zoltan Haindrich
7.	Cartesian error for joins defined in where clause	Resolved	Unassigned
8.	Generalize hook dispatch logics in Driver	Closed	Zoltan Haindrich
9.	Add ConstantPropagate before stats annotation	Closed	Zoltan Haindrich
10.	Make Operator comparision to be based on some primitive	Closed	Zoltan Haindrich
11.	Imporve operator-tree matching	Closed	Zoltan Haindrich
12.	Retain and use runtime statistics during hs2 lifetime	Closed	Zoltan Haindrich
13.	There are 2 configs to detect/warn for cross products	Open	Unassigned
14.	Persist runtime statistics in metastore	Closed	Zoltan Haindrich
15.	Add an OpTreeSignature persistence checker hook	Closed	Zoltan Haindrich
16.	RuntimeStats fixes	Closed	Zoltan Haindrich
17.	Handle explain analyze for reoptimization	Open	Zoltan Haindrich
18.	Fix Signature matching of table aliases	Closed	Zoltan Haindrich
19.	Fix runtime stats for merge statement	Closed	Zoltan Haindrich

Activity

People

Assignee:: Zoltan Haindrich

Reporter:: Prasanth Jayachandran

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 27/Sep/17 23:25

Updated:: 22/May/18 23:14

Resolved:: 07/Mar/18 08:19