Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-960

Incorrect number of map tasks when there are multiple input files

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Invalid
    • 0.10.1
    • None
    • documentation
    • None

    Description

      This problem happens with hadoop-streaming and possibly elsewhere. If there are 5 input files, it will create 130 map tasks, even if mapred.map.tasks=128. The number of map tasks is incorrectly set to a multiple of the number of files. (I wrote a much more complete bug report, but Jira lost it when it had an error, so I'm not in the mood to write it all again)

      Attachments

        Activity

          People

            Unassigned Unassigned
            amcnabb Andrew McNabb
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: