Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-22119

Add cosine distance to KMeans

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 2.2.0
    • 2.4.0
    • ML, MLlib
    • None

    Description

      Currently, KMeans assumes the only possible distance measure to be used is the Euclidean.

      In some use cases, eg. text mining, other distance measures like the cosine distance are widely used. Thus, for such use cases, it would be good to support multiple distance measures.

      This ticket is to support the cosine distance measure on KMeans. Later, other algorithms can be extended to support several distance measures and other distance measures can be added.

      Attachments

        Issue Links

          Activity

            People

              mgaido Marco Gaido
              mgaido Marco Gaido
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: