Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-1497

mahout resplit not producing splited files

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.8
    • 0.10.0
    • classic
    • None

    Description

      when I run "mahout resplit", I get the output below but no split files are being produced.

      support@hadoop1:~$ mahout resplit --input .../final/clusteredPoints/part-m-* --output .../final/split --numSplits 4
      MAHOUT_LOCAL is not set; adding HADOOP_CONF_DIR to classpath.
      Running on hadoop, using /opt/cloudera/parcels/CDH-5.0.0-0.cdh5b2.p0.27/bin/../lib/hadoop/bin/hadoop and HADOOP_CONF_DIR=/etc/hadoop/conf
      MAHOUT-JOB: /opt/cloudera/parcels/CDH-5.0.0-0.cdh5b2.p0.27/lib/mahout/mahout-examples-0.8-cdh5.0.0-beta-2-job.jar
      14/03/28 16:22:50 WARN driver.MahoutDriver: No resplit.props found on classpath, will use command-line arguments only
      Writing 4 splits
      Writing split 0
      Writing split 1
      Writing split 2
      Writing split 3
      14/03/28 16:22:52 INFO driver.MahoutDriver: Program took 2077 ms (Minutes: 0.034616666666666664)
      

      The folder "cluteredPoints" passed to --input of resplit contains clustered points generated by k-means algorithm from mahout.

      Attachments

        Activity

          People

            ssc Sebastian Schelter
            reinis_v Reinis Vicups
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: