Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-1224

Add the option of running a StreamingKMeans pass in the Reducer before BallKMeans

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.8
    • 0.8
    • classic
    • None

    Description

      Sometimes, the number of points passed to the reducer from the mappers in the StreamingKMeansDriver job is too large to fit into memory.

      In that case, applying another StreamingKMeans pass can collapse the mapper intermediate clusters to a more manageable size to be clustered.

      Attachments

        Activity

          People

            dfilimon Dan Filimon
            dfilimon Dan Filimon
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: