[PHOENIX-1556] Base hash versus sort merge join decision on cost - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 4.14.0, 5.0.0
Component/s: None
Labels:
- CostBasedOptimization

Description

At compile time, we know how many guideposts (i.e. how many bytes) will be scanned for the RHS table. We should, by default, base the decision of using the hash-join verus many-to-many join on this information.

Another criteria (as we've seen in ~~PHOENIX-4508~~) is whether or not the tables being joined are already ordered by the join key. In that case, it's better to always use the sort merge join.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

PHOENIX-1556.patch
12/Feb/18 23:06
140 kB
Wei Xue

Activity

People

Assignee:: Wei Xue

Reporter:: James R. Taylor

Votes:: 1 Vote for this issue

Watchers:: 8 Start watching this issue

Dates

Created:: 23/Dec/14 23:44

Updated:: 26/Jul/18 01:14

Resolved:: 12/Feb/18 23:06