Details
-
Improvement
-
Status: Closed
-
Critical
-
Resolution: Fixed
-
None
-
None
-
None
-
Reviewed
Description
In the case where the entire reduce inputs fit in ram, we currently force the input to disk and re-read it before giving it to the reducer. It would be much better if we merged from the ramfs and any spills to feed the reducer its input.
Attachments
Attachments
Issue Links
- is depended upon by
-
HADOOP-4063 Separate spill thresholds for serialization/accounting in MapTask
- Closed