[HIVE-3784] de-emphasize mapjoin hint - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.11.0
Component/s: Query Processor
Labels:
None

Release Note:
Map join hint will no longer be valid for some queries. Drop the hint in those cases. Hive will automatically try to convert join to map-join with config hive.auto.convert.join set to true.

Description

hive.auto.convert.join has been around for a long time, and is pretty stable.
When mapjoin hint was created, the above parameter did not exist.

The only reason for the user to specify a mapjoin currently is if they want
it to be converted to a bucketed-mapjoin or a sort-merge bucketed mapjoin.
Eventually, that should also go away, but that may take some time to stabilize.

There are many rules in SemanticAnalyzer to handle the following trees:

ReduceSink -> MapJoin
Union -> MapJoin
MapJoin -> MapJoin

This should not be supported anymore. In any of the above scenarios, the
user can get the mapjoin behavior by setting hive.auto.convert.join to true
and not specifying the hint. This will simplify the code a lot.

What does everyone think ?

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

hive.3784.1.patch
12/Dec/12 11:43
209 kB
Namit Jain
hive.3784.10.patch
24/Jan/13 11:24
504 kB
Namit Jain
hive.3784.11.patch
24/Jan/13 17:02
547 kB
Namit Jain
hive.3784.12.patch
25/Jan/13 04:40
550 kB
Namit Jain
hive.3784.13.patch
25/Jan/13 05:18
617 kB
Namit Jain
hive.3784.14.patch
25/Jan/13 05:34
617 kB
Namit Jain
hive.3784.15.patch
25/Jan/13 06:50
657 kB
Namit Jain
hive.3784.16.patch
25/Jan/13 13:28
658 kB
Namit Jain
hive.3784.17.patch
27/Jan/13 18:28
659 kB
Namit Jain
hive.3784.18.patch
27/Jan/13 18:28
659 kB
Namit Jain
hive.3784.19.patch
27/Jan/13 18:32
659 kB
Namit Jain
hive.3784.2.patch
12/Dec/12 12:08
209 kB
Namit Jain
hive.3784.21.patch
28/Jan/13 17:49
659 kB
Namit Jain
hive.3784.22.patch
29/Jan/13 03:52
665 kB
Namit Jain
hive.3784.3.patch
13/Dec/12 06:26
434 kB
Namit Jain
hive.3784.4.patch
13/Dec/12 08:51
650 kB
Namit Jain
hive.3784.5.patch
14/Dec/12 06:21
650 kB
Namit Jain
hive.3784.6.patch
22/Jan/13 05:45
528 kB
Namit Jain
hive.3784.7.patch
23/Jan/13 17:04
571 kB
Namit Jain
hive.3784.8.patch
24/Jan/13 06:29
578 kB
Namit Jain
hive.3784.9.patch
24/Jan/13 09:23
581 kB
Namit Jain

Issue Links

duplicates

HIVE-3652 Join optimization for star schema

Resolved

HIVE-1695 MapJoin followed by ReduceSink should be done as single MapReduce Job

Closed

is depended upon by

HIVE-3952 merge map-job followed by map-reduce job

Closed

HIVE-4042 ignore mapjoin hint

Closed

is duplicated by

HIVE-3652 Join optimization for star schema

Resolved

is related to

HIVE-3403 user should not specify mapjoin to perform sort-merge bucketed join

Closed

HIVE-3633 sort-merge join does not work with sub-queries

Closed

relates to

HIVE-3326 plan for multiple mapjoin followed by a normal join is wrong

Resolved

HIVE-3403 user should not specify mapjoin to perform sort-merge bucketed join

Closed

(2 is related to, 2 relates to)

Activity

People

Assignee:: Namit Jain

Reporter:: Namit Jain

Votes:: 1 Vote for this issue

Watchers:: 14 Start watching this issue

Dates

Created:: 10/Dec/12 06:20

Updated:: 13/May/15 07:45

Resolved:: 29/Jan/13 15:38