Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-1497

mahout resplit not producing splited files

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.8
    • Fix Version/s: 0.10.0
    • Component/s: CLI
    • Labels:
      None

      Description

      when I run "mahout resplit", I get the output below but no split files are being produced.

      support@hadoop1:~$ mahout resplit --input .../final/clusteredPoints/part-m-* --output .../final/split --numSplits 4
      MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
      Running on hadoop, using /opt/cloudera/parcels/CDH-5.0.0-0.cdh5b2.p0.27/bin/../lib/hadoop/bin/hadoop and HADOOP_CONF_DIR=/etc/hadoop/conf
      MAHOUT-JOB: /opt/cloudera/parcels/CDH-5.0.0-0.cdh5b2.p0.27/lib/mahout/mahout-examples-0.8-cdh5.0.0-beta-2-job.jar
      14/03/28 16:22:50 WARN driver.MahoutDriver: No resplit.props found on classpath, will use command-line arguments only
      Writing 4 splits
      Writing split 0
      Writing split 1
      Writing split 2
      Writing split 3
      14/03/28 16:22:52 INFO driver.MahoutDriver: Program took 2077 ms (Minutes: 0.034616666666666664)
      

      The folder "cluteredPoints" passed to --input of resplit contains clustered points generated by k-means algorithm from mahout.

        Attachments

          Activity

            People

            • Assignee:
              ssc Sebastian Schelter
              Reporter:
              reinis_v Reinis Vicups
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: