Uploaded image for project: 'Hadoop Map/Reduce'
  1. Hadoop Map/Reduce
  2. MAPREDUCE-2042

InputFormat#getSplits() is called twice in local mode

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Abandoned
    • 0.20.1, 0.20.2
    • 0.20.3
    • job submission
    • None

    Description

      In local mode the InputFormat#getSplits() is called twice in local mode.

      • 1st time: JobClient#writeOld/NewSplits() (then they write the splits to disk )
      • 2nd time: LocalJobRunner#run() (instead of reading the split file )

      That can become annoying in case the InputFormat access external resources or takes a little longer to create the splits.

      Attachments

        1. MAPREDUCE-2042.patch
          3 kB
          Johannes Zillmann

        Activity

          People

            Unassigned Unassigned
            oae Johannes Zillmann
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: