Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-1871

Kmeans - java.lang.IllegalStateException: No input clusters found..... Check your -c argument

    XMLWordPrintableJSON

    Details

    • Type: Question
    • Status: Resolved
    • Priority: Critical
    • Resolution: Not A Bug
    • Affects Version/s: 0.12.1
    • Fix Version/s: 0.13.0
    • Component/s: Clustering
    • Labels:
      None
    • Environment:

      S.O. Centos 6.5
      hadoop 2.7.2

      Description

      By using the kmeans with the following parameters gives the following error.

      16/06/12 17:35:43 INFO KMeansDriver: convergence: 0.5 max Iterations: 10
      16/06/12 17:35:43 INFO CodecPool: Got brand-new decompressor [.deflate]
      Exception in thread "main" java.lang.IllegalStateException: No input clusters found in /mdb/clustered_data/part-randomSeed. Check your -c argument.
      at org.apache.mahout.clustering.kmeans.KMeansDriver.buildClusters(KMeansDriver.java:213)
      at org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:147)
      at org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:110)
      at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
      at org.apache.mahout.clustering.kmeans.KMeansDriver.main(KMeansDriver.java:47)
      at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
      at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
      at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
      at java.lang.reflect.Method.invoke(Method.java:606)
      at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:71)
      at org.apache.hadoop.util.ProgramDriver.run(ProgramDriver.java:144)
      at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:152)
      at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
      at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
      at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
      at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
      at java.lang.reflect.Method.invoke(Method.java:606)
      at org.apache.hadoop.util.RunJar.run(RunJar.java:221)
      at org.apache.hadoop.util.RunJar.main(RunJar.java:136)
      [SSH] exit-status: 1
      Finished: FAILURE

      Command Execution:
      hdfs dfs -rm -R /mdb/mahout_vectors/
      hdfs dfs -rm -R /mdb/mahout_seq/
      hdfs dfs -rm -R /mdb/mahout_data/
      hdfs dfs -rm -R /mdb/clustered_data/
      echo ##### SE ELIMINAN LAS CARPETAS DE HDFS#####
      hdfs dfs -mkdir /mdb/mahout_vectors/
      hdfs dfs -mkdir /mdb/mahout_seq/
      hdfs dfs -mkdir /mdb/mahout_data/
      hdfs dfs -mkdir /mdb/clustered_data/
      echo ##### subimos el fichero #####
      hdfs dfs -put $fichero /mdb/mahout_data/
      echo ##### generamos ficheros secuenciales#####
      mahout seqdirectory -i /mdb/mahout_data/ -o /mdb/mahout_seq -c UTF-8 -chunk 64 -xm sequential
      echo ##### generamos los vectores #####
      mahout seq2sparse -i /mdb/mahout_seq/ -o /mdb/mahout_vectors/ --namedVector
      echo ##### ejecutamos el kmeans #####
      mahout kmeans -i /mdb/mahout_vectors/tfidf-vectors/ -c /mdb/clustered_data -o /mdb/mahout_data -dm org.apache.mahout.common.distance.EuclideanDistanceMeasure -x 10 -k 20 -ow --clustering

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              jcsipan Juan Carlos Sipan Robles
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: