[HADOOP-3702] add support for chaining Maps in a single Map and after a Reduce [M*/RM*] - ASF JIRA

Log work

Agile Board

Rank to Top

Rank to Bottom

Attach files

Attach Screenshot

Bulk Copy Attachments

Bulk Move Attachments

Voters

Watch issue

Watchers

Create sub-task

Convert to sub-task

Move

Link

Clone

Labels

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.19.0
Component/s: None
Labels:
None
Environment:

all

Hadoop Flags:

Reviewed
Release Note:

Hide
Introduced ChainMapper and the ChainReducer classes to allow composing chains of Maps and Reduces in a single Map/Reduce job, something like MAP+ REDUCE MAP*.

Show
Introduced ChainMapper and the ChainReducer classes to allow composing chains of Maps and Reduces in a single Map/Reduce job, something like MAP+ REDUCE MAP*.

Description

On the same input, we usually need to run multiple Maps one after the other without no Reduce. We also have to run multiple Maps after the Reduce.

If all pre-Reduce Maps are chained together and run as a single Map a significant amount of Disk I/O will be avoided.

Similarly all post-Reduce Maps can be chained together and run in the Reduce phase after the Reduce.

Attachments

Hadoop-3702.patch
14/Aug/08 16:31
60 kB
Enis Soztutar
patch3702.txt
08/Sep/08 03:30
60 kB
Alejandro Abdelnur
patch3702.txt
31/Aug/08 09:32
59 kB
Alejandro Abdelnur
patch3702.txt
28/Aug/08 06:45
59 kB
Alejandro Abdelnur
patch3702.txt
29/Jul/08 15:34
58 kB
Alejandro Abdelnur
patch3702.txt
25/Jul/08 03:48
57 kB
Alejandro Abdelnur
patch3702.txt
24/Jul/08 06:01
56 kB
Alejandro Abdelnur
patch3702.txt
23/Jul/08 13:40
54 kB
Alejandro Abdelnur
patch3702.txt
23/Jul/08 12:36
53 kB
Alejandro Abdelnur
patch3702.txt
16/Jul/08 18:22
47 kB
Alejandro Abdelnur
patch3702.txt
16/Jul/08 07:40
45 kB
Alejandro Abdelnur
patch3702.txt
14/Jul/08 09:34
33 kB
Alejandro Abdelnur
patch3702.txt
14/Jul/08 03:23
30 kB
Alejandro Abdelnur
patch3702.txt
10/Jul/08 07:19
30 kB
Alejandro Abdelnur

Issue Links

Add Link

incorporates

HADOOP-3927 Be able to create a Configuration out an stream (with config XML content)

Closed

Delete this link

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Alejandro Abdelnur Assign to me

Reporter:: Alejandro Abdelnur

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 07/Jul/08 07:41

Updated:: 08/Jul/09 16:52

Resolved:: 09/Sep/08 13:19

Agile

View on Board

add support for chaining Maps in a single Map and after a Reduce [M/RM]

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Agile

Slack

Issue deployment