[MAPREDUCE-4502] Node-level aggregation with combining the result of maps - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Patch Available
Priority: Major
Resolution: Unresolved
Affects Version/s: 3.0.0-alpha1
Fix Version/s: None
Component/s: applicationmaster, mrv2
Labels:
- BB2015-05-TBR

Description

The shuffle costs is expensive in Hadoop in spite of the existence of combiner, because the scope of combining is limited within only one MapTask. To solve this problem, it's a good way to aggregate the result of maps per node/rack by launch combiner.

This JIRA is to implement the multi-level aggregation infrastructure, including combining per container(MAPREDUCE-3902 is related), coordinating containers by application master without breaking fault tolerance of jobs.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

design_v2.pdf
26/Sep/12 01:31
347 kB
Tsuyoshi Ozawa
design_v3.pdf
17/Apr/13 11:18
440 kB
Tsuyoshi Ozawa
MAPREDUCE-4502.1.patch
20/Feb/13 03:35
113 kB
Tsuyoshi Ozawa
MAPREDUCE-4502.10.patch
17/Jul/13 15:46
109 kB
Tsuyoshi Ozawa
MAPREDUCE-4502.2.patch
21/Feb/13 01:36
115 kB
Tsuyoshi Ozawa
MAPREDUCE-4502.3.patch
21/Feb/13 07:18
115 kB
Tsuyoshi Ozawa
MAPREDUCE-4502.4.patch
21/Feb/13 10:40
115 kB
Tsuyoshi Ozawa
MAPREDUCE-4502.5.patch
25/Feb/13 07:03
115 kB
Tsuyoshi Ozawa
MAPREDUCE-4502.6.patch
25/Feb/13 08:55
116 kB
Tsuyoshi Ozawa
MAPREDUCE-4502.7.patch
26/Apr/13 17:01
117 kB
Tsuyoshi Ozawa
MAPREDUCE-4502.8.patch
30/Apr/13 06:33
117 kB
Tsuyoshi Ozawa
MAPREDUCE-4502.8.patch
29/Apr/13 22:48
117 kB
Tsuyoshi Ozawa
MAPREDUCE-4502.9.patch
17/Jul/13 10:29
109 kB
Tsuyoshi Ozawa
MAPREDUCE-4502.9.patch
17/Jul/13 10:03
109 kB
Tsuyoshi Ozawa
MAPREDUCE-4525-pof.diff
07/Dec/12 15:28
71 kB
Tsuyoshi Ozawa
speculative_draft.pdf
09/Sep/12 20:31
142 kB
Tsuyoshi Ozawa

Issue Links

is related to

TAJO-374 Investigate more efficient intermediate shuffle methods

Resolved

Sub-Tasks

1.	Combiner per node	Open	Tsuyoshi Ozawa
2.	Adding aggregationWaitMap for node-level combiner.	In Progress	Tsuyoshi Ozawa
3.	Adding new umbilical protocol RPC, "getAggregationTargets()", for node-level combiner.	In Progress	Tsuyoshi Ozawa
4.	Launching node-level combiner at the end stage of MapTask and ignoring aggregated inputs at ReduceTask	In Progress	Tsuyoshi Ozawa
5.	Adding AggregationWaitMap to some components(MRAppMaster, TaskAttemptListener, JobImpl, MapTaskImpl).	In Progress	Tsuyoshi Ozawa
6.	Add node-level aggregation flag feature(setNodeLevelAggregation(boolean)) to JobConf	Patch Available	Tsuyoshi Ozawa

Activity

People

Assignee:: Tsuyoshi Ozawa

Reporter:: Tsuyoshi Ozawa

Votes:: 0 Vote for this issue

Watchers:: 34 Start watching this issue

Dates

Created:: 01/Aug/12 06:46

Updated:: 12/May/16 18:23