[HIVE-14199] Enable Bucket Pruning for ACID tables - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.3.0
Component/s: Transactions
Labels:
None

Target Version/s:

2.2.0
Hadoop Flags:

Reviewed

Description

Currently, ACID tables do not benefit from the bucket pruning feature introduced in ~~HIVE-11525~~. The reason for this has been the fact that bucket pruning happens at split generation level and for ACID, traditionally the delta files were never split. The parallelism for ACID was then restricted to the number of buckets. There would be as many splits as the number of buckets and each worker processing one split would inevitably read all the delta files for that bucket, even when the query may have originally required only one of the buckets to be read.
However, ~~HIVE-14035~~ now enables even the delta files to be also split. What this means is that now we have enough information at the split generation level to determine appropriate buckets to process for the delta files. This can efficiently allow us to prune unnecessary buckets for delta files and will lead to good performance gain for a large number of selective queries on ACID tables.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-14199.01.patch
09/Jul/16 00:06
11 kB
Saket Saurabh
HIVE-14199.02.patch
09/Jul/16 02:04
10 kB
Saket Saurabh
HIVE-14199.03.patch
12/Aug/16 22:27
10 kB
Saket Saurabh

Issue Links

is related to

HIVE-11525 Bucket pruning

Closed

requires

HIVE-14035 Enable predicate pushdown to delta files created by ACID Transactions

Resolved

Activity

People

Assignee:: Saket Saurabh

Reporter:: Saket Saurabh

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 08/Jul/16 23:50

Updated:: 11/Oct/17 17:20

Resolved:: 22/Aug/16 22:35