Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-1442

Zero-byte input files are not included in InputSplit

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.12.3
    • 0.14.0
    • None
    • None
    • ALL

    Description

      The default FileInputFormat::getSplits method does not include zero-byte input files in FileSplit[]. Applications that assume the minimum number of maps to be at least number of input files can get bitten by this, as well as applications that use -reducer NONE to produce number of partitions that are equal to input partitions.

      Attachments

        1. HADOOP-1442_1.patch
          3 kB
          Senthil Subramanian

        Activity

          People

            senthil Senthil Subramanian
            milindb Milind Barve
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: