Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-3780

Jaccard Similarity

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.1.0
    • 1.1.0
    • None

    Description

      Implement a Jaccard Similarity algorithm computing all non-zero similarity scores. This algorithm is similar to TriangleListing but instead of joining two-paths against an edge list we count two-paths.

      flink-gelly-examples currently has JaccardSimilarityMeasure which relies on Graph.getTriplets() so only computes similarity scores for neighbors but not neighbors-of-neighbors.

      This algorithm is easily modified for other similarity scores such as Adamic-Adar similarity where the sum of endpoint degrees is replaced by the degree of the middle vertex.

      Attachments

        Issue Links

          Activity

            People

              greghogan Greg Hogan
              greghogan Greg Hogan
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: