Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-12879

Query Parser for MinHash/LSH

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 8.0
    • Fix Version/s: 8.0
    • Component/s: query parsers
    • Labels:
      None

      Description

      Following on from https://issues.apache.org/jira/browse/LUCENE-6968, provide a query parser that builds queries that provide a measure of Jaccard similarity. The initial patch includes banded queries that were also proposed on the original issue.

       

      I have one outstanding questions:

      • Should the score from the overall query be normalised?

      Note, that the band count is currently approximate and may be one less than in practise.

        Attachments

        1. minhash.filter.adoc.fragment
          3 kB
          Andy Hind
        2. minhash.patch
          75 kB
          Andy Hind
        3. minhash.qparser.adoc.fragment
          9 kB
          Andy Hind

          Issue Links

            Activity

              People

              • Assignee:
                teofili Tommaso Teofili
                Reporter:
                andyhind Andy Hind
              • Votes:
                0 Vote for this issue
                Watchers:
                5 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: