Details
-
Bug
-
Status: Closed
-
Blocker
-
Resolution: Fixed
-
None
-
None
-
None
-
Reviewed
Description
In hadoop 16 and earlier, the combiner was guaranteed to run once and only once for each map. In 17 this compatibility was slightly broken: the combiner does not run if a single <K,V> occupies the entire sort buffer. In 18, this is further changed to where the combiner can be called multiple times on both map and reduce sides.
This breaks Pig's current implementation of the combiner and it is not easy to fix in a short period of time.
We would like to ask that for a way for an application to ask for a backward compatible behavior for some period of time until it can adjust to the new behavior.
Attachments
Attachments
Issue Links
- incorporates
-
HADOOP-3594 Guaranteeing that combiner is called at least once
-
- Closed
-
- relates to
-
HADOOP-3595 Remove deprecated mapred.combine.once functionality
-
- Closed
-