[CRUNCH-3] Replicated ("map-side") joins - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.3.0
Component/s: MapReduce Patterns
Labels:
None

Description

Replicated joins are a common way to improve performance when joining a large dataset with a small one. The smaller dataset is loaded into memory in the mapper/reducer tasks, and is then joined with the larger dataset as the large one is processed by the MapReduce job itself.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

mapside-joins.patch
07/Jul/12 05:58
29 kB
Gabriel Reid

Activity

People

Assignee:: Gabriel Reid

Reporter:: Josh Wills

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 05/Jul/12 22:10

Updated:: 17/Sep/12 06:41

Resolved:: 07/Jul/12 05:59