Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
1.0.0, 1.1.0, 1.3.0
-
None
Description
Hadoop natively supports multiple outputs. The objective is to extend Giraph to support multiple output formats during a single giraph run.
According to the official Hadoop apidocs*, to take advantage of multiple outputs the the pattern is the following:
- Modify the job submission
- Modify the reducer class to write on the declared different outputs
Since Giraph jobs are executed as mappers, probably this approach (or at least its second part) is not feasible, so further investigation is necessary.