Mahout
  1. Mahout
  2. MAHOUT-419

Convert decomposer code to Hadoop 0.20 API

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Major Major
    • Resolution: Duplicate
    • Affects Version/s: 0.3, 0.4
    • Fix Version/s: None
    • Component/s: Math

      Description

      org.apache.mahout.math.hadoop classes (MatrixMultiplicationJob, TimesSquaredJob, TransposeJob) all use the deprecated Hadoop API. In the spirit of MAHOUT-167 and MAHOUT-143, I suggest converting them to Hadoop's 0.20 API.

      The reason I'm raising this now is that this code no longer runs on my Hadoop 0.22-SNAPSHOT cluster (not sure why really - it was running fine about a month ago, but after updating to the latest Mahout trunk a few days ago the code throws "java.lang.RuntimeException: Error in configuring object" at MapTask.runOldMapper).

      Also, the documentation at https://cwiki.apache.org/MAHOUT/dimensionalreduction.html is no longer accurate - the command line parameters have changed (even without the new arguments from MAHOUT-308). This is partly due to using the new argument parser which receives the input/output directories differently.

        Activity

        Hide
        Sean Owen added a comment -

        I agree, and purely for housekeeping purposes I want to roll this into mega-issue MAHOUT-167.

        Show
        Sean Owen added a comment - I agree, and purely for housekeeping purposes I want to roll this into mega-issue MAHOUT-167 .

          People

          • Assignee:
            Jake Mannix
            Reporter:
            Danny Leshem
          • Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development