[HIVE-3652] Join optimization for star schema - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Duplicate
Affects Version/s: None
Fix Version/s: None
Component/s: Query Processor
Labels:
None

Description

Currently, if we join one fact table with multiple dimension tables, it results in multiple mapreduce jobs for each join with dimension table, because join would be on different keys for each dimension.
Usually all the dimension tables will be small and can fit into memory and so map-side join can used to join with fact table.

In this issue I want to look at optimizing such query to generate single mapreduce job sothat mapper loads dimension tables into memory and joins with fact table on different keys as well.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

HIVE-3652-tests.patch
13/Feb/13 08:44
17 kB
Amareshwari Sriramadasu
HIVE-3652-tests.patch
13/Feb/13 08:11
23 kB
Amareshwari Sriramadasu

Issue Links

duplicates

HIVE-3784 de-emphasize mapjoin hint

Closed

is duplicated by

HIVE-3784 de-emphasize mapjoin hint

Closed

Activity

People

Assignee:: Vikram Dixit K

Reporter:: Amareshwari Sriramadasu

Votes:: 0 Vote for this issue

Watchers:: 14 Start watching this issue

Dates

Created:: 02/Nov/12 06:53

Updated:: 06/May/13 18:17

Resolved:: 13/Feb/13 10:48