Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-3439

Add Canopy Clustering Algorithm

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Minor
    • Resolution: Won't Fix
    • None
    • None
    • MLlib

    Description

      The canopy clustering algorithm is an unsupervised pre-clustering algorithm. It is often used as a preprocessing step for the K-means algorithm or the Hierarchical clustering algorithm. It is intended to speed up clustering operations on large data sets, where using another algorithm directly may be impractical due to the size of the data set.

      Attachments

        Activity

          People

            angellandros Muhammad-Ali A'rabi
            yuu.ishikawa@gmail.com Yu Ishikawa
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: