Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-3616

TextInputFormat taking max of two minima as the minimum

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 0.17.0
    • None
    • None
    • None

    Description

      When choosing its minimum split size, FileInputFormat is using the larger of the two minimum split values, instead of the smaller. I can't find any good explanation for why this would be, so it would be helpful to add a comment there (or change to Math.min if that's the intent).

      Line 237:
      long minSize = Math.max(job.getLong("mapred.min.split.size", 1), minSplitSize);

      Attachments

        Activity

          People

            Unassigned Unassigned
            jbm Josh Myer
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: