Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-1224

Add the option of running a StreamingKMeans pass in the Reducer before BallKMeans

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 0.8
    • Fix Version/s: 0.8
    • Component/s: Clustering
    • Labels:
      None

      Description

      Sometimes, the number of points passed to the reducer from the mappers in the StreamingKMeansDriver job is too large to fit into memory.

      In that case, applying another StreamingKMeans pass can collapse the mapper intermediate clusters to a more manageable size to be clustered.

        Attachments

          Activity

            People

            • Assignee:
              dfilimon Dan Filimon
              Reporter:
              dfilimon Dan Filimon
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: