Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-160

ClusterDumper utility to output all the clusters in all sequence files and points

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.2
    • None
    • None

    Description

      The current ClusterDumper utility takes a sequence file and points file as input and prints the cluster vector along with the points that belong to the clusters in the sequence file. This utility doesn't produce correct results in case there are multiple sequence files and points.

      To avoid this problem, all the point to cluster mappings need to be read first and then iterate on the sequence files.

      Attachments

        1. mahout-160-dict.patch
          12 kB
          Shashikant Kore
        2. mahout-160.patch
          8 kB
          Shashikant Kore

        Activity

          People

            gsingers Grant Ingersoll
            kshashi Shashikant Kore
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: