Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
See HIVE-4002.
We can replace OutputCollector in ReduceSink to output to say a sequence file. Then instead of fetching file output written by file sinks in map tasks, the client-side reducer can fetch reducer output via some simple operator and work the same way as normal reducer.
It can also take advantage of additional ReduceSink functionality.