Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-12879

Query Parser for MinHash/LSH

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 8.0
    • 8.0
    • query parsers
    • None

    Description

      Following on from https://issues.apache.org/jira/browse/LUCENE-6968, provide a query parser that builds queries that provide a measure of Jaccard similarity. The initial patch includes banded queries that were also proposed on the original issue.

       

      I have one outstanding questions:

      • Should the score from the overall query be normalised?

      Note, that the band count is currently approximate and may be one less than in practise.

      Attachments

        1. minhash.filter.adoc.fragment
          3 kB
          Andy Hind
        2. minhash.patch
          75 kB
          Andy Hind
        3. minhash.qparser.adoc.fragment
          9 kB
          Andy Hind

        Issue Links

          Activity

            People

              teofili Tommaso Teofili
              andyhind Andy Hind
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 10m
                  10m