Uploaded image for project: 'Bigtop'
  1. Bigtop
  2. BIGTOP-1128

FIX and modularize mahout sample data sets

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.7.0
    • Fix Version/s: 0.8.0
    • Component/s: tests
    • Labels:
      None

      Description

      The mahout smokes have alot of dependencies

      Concretely, we need to fix the movie lens sample data which has moved....
      from http://www.grouplens.org/system/files/ml-1m.zip
      to http://files.grouplens.org/papers/ml-1m.zip

      Otherwise mahout smokes break for obvious reasons.

      More generally, consolidating and verifying these download URLs in a separate function might make for simpler debugging of the tests, otherwise, you get html documents stored as .zip files, which causes a very hard to interpret error in the tests, i.e. you get an exception about how the zip file isnt formatted correctly.

      Other Thoughts on how to simplify and isolate moving parts of mahout tests?
      We can bundle them into a patch. Would be a shame if the only thing this JIRA resulted in was a fix to a single URL ....

        Attachments

          Activity

            People

            • Assignee:
              jayunit100 jay vyas
              Reporter:
              jayunit100 jay vyas
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: