Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-1090

In SortValidator, the check for whether a file belongs to sort-input or sort-output dir is weak

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.12.2
    • 0.13.0
    • None
    • None

    Description

      In SortValidator, Maps invoke the method called deduceInputFile in the configure method. The deduceInputFile is supposed to return whether the input file belongs to the sort-input directory or the sort-output directory. However, the check that deduceInputFile does - inputFile.toString().startsWith(inputPaths[0].toString()) - is not totally correct. The check will always returns true for inputPaths like /user/foo/smallInput/<filenames>, /user/foo/smallInput-sorted/<filenames>. This finally causes the SortValidator to declare the sort output as incorrect.

      Attachments

        1. HADOOP-1090_20070331_1.patch
          0.5 kB
          Arun Murthy

        Activity

          People

            acmurthy Arun Murthy
            ddas Devaraj Das
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: