Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-13009

ExtendedSolrQueryParser cannot handle with KeywordRepeat, Synonyms and minShouldMatch

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 7.5
    • 8.2
    • query parsers
    • None

    Description

      As described on the mailing list [1].

      Consider this example synonym set [bier,brouw] where bier means beer and brouw is the stemmed version of brouwsel (brewage, concoction), and consider these parameters on /select:

      qf=content_nl&defType=edismax&mm=2<-1 5<-2 6<90%25

      The queries q=bier and q=brouw both give the desired results. However, for q=brouwsel something (partially) unexpected happens:

      +(((content_nl:brouwsel Synonym(content_nl:bier content_nl:brouw))~2))

      This results in a BooleanQuery where, due to mm=2, both clauses need to match, giving very few matches. Removing KeywordRepeat or setting mm=1 of course fixes the problem, but that is not what we want.

      Will shortly attatch a patch containing a unit test proving the problem is real.

      [1] http://lucene.472066.n3.nabble.com/KeywordRepeat-stemming-single-term-synonyms-and-minimum-should-match-edismax-td4413690.html

      Attachments

        1. SOLR-13009-unittest.patch
          6 kB
          Markus Jelsma

        Activity

          People

            Unassigned Unassigned
            markus17 Markus Jelsma
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: