Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
0.20.1, 0.20.2, 0.20.3, 0.21.0, 0.21.1, 0.22.0
-
None
-
None
-
Ubuntu 10.04 Lucid 64bit, Java(TM) SE Runtime Environment (build 1.6.0_20-b02), Hadoop 20.2 (Cloudera distribution)
-
hadoop mapreduce pipes map output class
Description
In org.apache.hadoop.mapred.pipes.PipesMapRunner.java at lines 71-72 instead of:
- job.getOutputKeyClass() and
- job.getOutputValueClass()
there should be called:
- job.getMapOutputKeyClass() and
- job.getMapOutputValueClass()
This modification will allow pipe jobs to use intermediate map output classes different from the job final output classes.