Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
Description
There are some applications where the map wants to partition the data itself. This happens in Pipes, if the user has a C++ partitioner. It would make sense to support it in streaming too. There is also use case where the Java partitioner needs the context object to update counters, etc.
This jira is only about adding the method to the mapreduce Java API. The Pipes interface can be updated in a follow up Jira.