Details
-
New Feature
-
Status: Closed
-
Minor
-
Resolution: Not A Problem
-
0.9.0
-
None
-
None
Description
One of the outputs from our system needs to be a single file to support a system that is ingesting the data downstream. We currently run the job and then cat the output files together to create the final output, but it would be nice if we could pass a flag to the write(...) function to handle this case.
Note that setting the number of reducers globally for the entire job doesn't work in this case because of the significant performance implications.