Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-99

Improving speed of KMeans

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.1
    • classic
    • None

    Description

      Improved the speed of KMeans by passing only cluster ID from mapper to reducer. Previously, whole Cluster Info as formatted s`tring was being sent.
      Also removed the implicit assumption of Combiner runs only once approach and the code is modified accordingly so that it won't create a bug when combiner runs zero or more than once.

      Attachments

        1. MAHOUT-99-1.patch
          44 kB
          Rohini Uppuluri
        2. MAHOUT-99.patch
          42 kB
          Pallavi Palleti
        3. MAHOUT-99.patch
          10 kB
          Pallavi Palleti
        4. Mahout-99.patch
          49 kB
          Pallavi Palleti

        Issue Links

          Activity

            People

              gsingers Grant Ingersoll
              pallavipalleti Pallavi Palleti
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: