Details
-
New Feature
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
-
all
-
Reviewed
-
Description
On the same input, we usually need to run multiple Maps one after the other without no Reduce. We also have to run multiple Maps after the Reduce.
If all pre-Reduce Maps are chained together and run as a single Map a significant amount of Disk I/O will be avoided.
Similarly all post-Reduce Maps can be chained together and run in the Reduce phase after the Reduce.
Attachments
Attachments
Issue Links
- incorporates
-
HADOOP-3927 Be able to create a Configuration out an stream (with config XML content)
- Closed