Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-1351

Adding DenseVector support to AbstractCluster

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 0.8
    • Fix Version/s: 0.9
    • Component/s: Clustering
    • Labels:

      Description

      This improvement reduces runtime by 80% when performing k-means clustering of Scale Invariant Feature Transform (SIFT) descriptors to derive visual words for computer vision. Unlike sparse document vectors, SIFT descriptors are dense. This improvement involves updating the org.apache.mahout.clustering.AbstractCluster(Vector point, int id2) constructor to use "point.clone()" instead of "new RandomAccessSparseVector(point)" for creating the centroid. Also added testKMeansSeqJobDenseVector() test for DenseVector processing.

        Attachments

        1. MAHOUT-1351.patch
          5 kB
          Dave DeBarr

          Activity

            People

            • Assignee:
              smarthi Suneel Marthi
              Reporter:
              debarr Dave DeBarr
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 1h
                1h
                Remaining:
                Remaining Estimate - 1h
                1h
                Logged:
                Time Spent - Not Specified
                Not Specified