Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
Description
Streaming should produce a short per-job log in a persistent location.
The log should include (at least, but not limited to):
– the command line that hadoop-streaming is executing
– the list of matching input fragments
– the actual command lines used by the framework for -mapper and for -reducer
– log about all copied files
– the name of the output HDFS directory
– start and end time of the job
– location of stderr logs and other output.