[HIVE-8638] Implement bucket map join optimization [Spark Branch] - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.1.0
Component/s: Spark
Labels:
None

Description

In the hive-on-mr implementation, bucket map join optimization has to depend on the map join hint. While in the hive-on-tez implementation, a join can be automatically converted to bucket map join if certain conditions are met such as:
1. the optimization flag hive.convert.join.bucket.mapjoin.tez is ON
2. all join tables are buckets and each small table's bucket number can be divided by big table's bucket number
3. bucket columns == join columns

In the hive-on-spark implementation, it is ideal to have the bucket map join auto-convertion support. when all the required criteria are met, a join can be automatically converted to a bucket map join.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-8638.4-spark.patch
07/Dec/14 05:11
262 kB
Jimmy Xiang
HIVE-8638.5-spark.patch
08/Dec/14 22:52
267 kB
Jimmy Xiang

Issue Links

is related to

HIVE-8405 Research Bucket Map Join [Spark Branch]

Resolved

relates to

HIVE-9042 Support multiple mapjoin operators in one work [Spark Branch]

Resolved

Activity

People

Assignee:: Jimmy Xiang

Reporter:: Na Yang

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Dates

Created:: 28/Oct/14 21:46

Updated:: 29/May/15 02:28

Resolved:: 09/Dec/14 02:23