Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-763

Map-Side Distance Comparison

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 0.6
    • None
    • None

    Description

      KMeans currently on the map-side calculates the distance between a set of seeds and all other vectors. It would be handy to have a generalization of this that, given a set of vectors that fits in memory (the seeds) and other points, emit <seed id, other id, distance> according to the distance measure. This is similar to the RowSimilarityJob, but much simpler and not as general purpose.

      Attachments

        1. MAHOUT-763.patch
          13 kB
          Grant Ingersoll
        2. MAHOUT-763.patch
          13 kB
          Grant Ingersoll
        3. MAHOUT-763.patch
          16 kB
          Grant Ingersoll
        4. MAHOUT-763.patch
          17 kB
          Grant Ingersoll
        5. SeedVectorUtil.patch
          7 kB
          Sean R. Owen

        Activity

          People

            gsingers Grant Ingersoll
            gsingers Grant Ingersoll
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: