Details
-
Improvement
-
Status: Open
-
Minor
-
Resolution: Unresolved
-
None
-
None
-
None
Description
The hadoop streaming docs here should be improved to include examples where "complicated quoting is can introduce bugs". The line including this text should read something like:
The "-file shellMapper.sh" part isn't entirely necessary. You can simply use a clause like "-mapper 'sed | grep | awk'" or some such but complicated quoting can introduce bugs. As an example, "-mapper 'grep "text"'" will not work. Wrapping the job in a shell script eliminates some of these issues, but quoting should be avoided where possible.