Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-3879

Native implementation of HITS algorithm

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.1.0
    • 1.1.0
    • None

    Description

      Hyperlink-Induced Topic Search (HITS, also "hubs and authorities") is presented in [0] and described in [1].

      "[HITS] is a very popular and effective algorithm to rank documents based on the link information among a set of documents. The algorithm presumes that a good hub is a document that points to many others, and a good authority is a document that many documents point to." https://pdfs.semanticscholar.org/a8d7/c7a4c53a9102c4239356f9072ec62ca5e62f.pdf

      This implementation differs from FLINK-2044 by providing for convergence, outputting both hub and authority scores, and completing in half the number of iterations.

      [0] http://www.cs.cornell.edu/home/kleinber/auth.pdf
      [1] https://en.wikipedia.org/wiki/HITS_algorithm

      Attachments

        Issue Links

          Activity

            People

              greghogan Greg Hogan
              greghogan Greg Hogan
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: