Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-12879

Query Parser for MinHash/LSH

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 8.0
    • 8.0
    • query parsers
    • None

    Description

      Following on from https://issues.apache.org/jira/browse/LUCENE-6968, provide a query parser that builds queries that provide a measure of Jaccard similarity. The initial patch includes banded queries that were also proposed on the original issue.

       

      I have one outstanding questions:

      • Should the score from the overall query be normalised?

      Note, that the band count is currently approximate and may be one less than in practise.

      Attachments

        1. minhash.patch
          75 kB
          Andy Hind
        2. minhash.filter.adoc.fragment
          3 kB
          Andy Hind
        3. minhash.qparser.adoc.fragment
          9 kB
          Andy Hind

        Issue Links

          Activity

            People

              teofili Tommaso Teofili
              andyhind Andy Hind
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 10m
                  10m