Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
0.17.0
-
None
-
None
-
Reviewed
Description
Hadoop 0.17 streaming allows specifying multiple user specific jar files on the command line along with the hadoop-streaming.jar. These user specific jar files are not automatically added on the client side class path when hadoop runs the streaming job
For example, in the following streaming job <user-app.jar> is not automatically included in client side path before executing the streaming job
hadoop jar -libjars <user-app.jar> $HADOOP_HOME/hadoop-streaming.jar \
-input </input-data-path> \
-output </output data path> \
-inputformat <Input-format> \
-outputformat <OutputFormat> \
-mapper /bin/cat -reducer /bin/cat \
-additionalconfspec <xxxx> \
-jobconf mapred.reduce.tasks=3\
Workaround:
Current work around is to set these user specific jar files in
HADOOP_CLASSPATH environment variable on the client side before running above example.