Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-614

org.apache.mahout.classifier.baytes.MultipleOutputFormat not working as intended with Hadoop 0.20?

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.4
    • 0.5
    • None

    Description

      I believe there might be an error in org.apache.mahout.classifier.baytes.MultipleOutputFormat. It overrides the Hadoop class FileOutputFormat, and most of its work is done in getRecordWriter(FileSystem, Configuration, String, Progressable). However this is not the method that one must override to control how FileOutputFormat writes records; that's getRecordWriter(TaskAttemptContext). My hunch is that this used to work, but against the Hadoop 0.19.x APIs. (@Override is our friend!)

      I've attached a patch that I believe addresses this and along the way is able to clean things up slightly. Am I on track here?

      Attachments

        1. MAHOUT-614.patch
          7 kB
          Sean R. Owen

        Activity

          People

            robinanil Robin Anil
            srowen Sean R. Owen
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: