Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-7879

KMeans API for spark.ml Pipelines

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.5.0
    • Component/s: ML
    • Labels:
      None
    • Target Version/s:

      Description

      Create a K-Means API for the spark.ml Pipelines API. This should wrap the existing KMeans implementation in spark.mllib.

      This should be the first clustering method added to Pipelines, and it will be important to consider SPARK-7610 and think about designing the clustering API. We do not have to have abstractions from the beginning (and probably should not) but should think far enough ahead so we can add abstractions later on.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                yuu.ishikawa@gmail.com Yu Ishikawa
                Reporter:
                josephkb Joseph K. Bradley
                Shepherd:
                Joseph K. Bradley
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: