Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-1933

Add distance measure interface and basic implementation to machine learning library

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • None

    Description

      Add distance measure interface to calculate distance between two vectors and some implementations of the interface. In FLINK-1745, till.rohrmann suggests a interface following:

      trait DistanceMeasure {
        def distance(a: Vector, b: Vector): Double
      }
      

      I think that following list of implementation is sufficient to provide first to ML library users.

      • Manhattan distance [1]
      • Cosine distance [2]
      • Euclidean distance (and Squared) [3]
      • Tanimoto distance [4]
      • Minkowski distance [5]
      • Chebyshev distance [6]

      [1]: http://en.wikipedia.org/wiki/Taxicab_geometry
      [2]: http://en.wikipedia.org/wiki/Cosine_similarity
      [3]: http://en.wikipedia.org/wiki/Euclidean_distance
      [4]: http://en.wikipedia.org/wiki/Jaccard_index#Tanimoto_coefficient_.28extended_Jaccard_coefficient.29
      [5]: http://en.wikipedia.org/wiki/Minkowski_distance
      [6]: http://en.wikipedia.org/wiki/Chebyshev_distance

      Attachments

        Issue Links

          Activity

            People

              chiwanpark Chiwan Park
              chiwanpark Chiwan Park
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: