Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Invalid
-
0.10.1
-
None
-
None
Description
This problem happens with hadoop-streaming and possibly elsewhere. If there are 5 input files, it will create 130 map tasks, even if mapred.map.tasks=128. The number of map tasks is incorrectly set to a multiple of the number of files. (I wrote a much more complete bug report, but Jira lost it when it had an error, so I'm not in the mood to write it all again)