Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-13009

ExtendedSolrQueryParser cannot handle with KeywordRepeat, Synonyms and minShouldMatch

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 7.5
    • Fix Version/s: master (9.0), 8.2
    • Component/s: query parsers
    • Labels:
      None

      Description

      As described on the mailing list [1].

      Consider this example synonym set [bier,brouw] where bier means beer and brouw is the stemmed version of brouwsel (brewage, concoction), and consider these parameters on /select:

      qf=content_nl&defType=edismax&mm=2<-1 5<-2 6<90%25

      The queries q=bier and q=brouw both give the desired results. However, for q=brouwsel something (partially) unexpected happens:

      +(((content_nl:brouwsel Synonym(content_nl:bier content_nl:brouw))~2))

      This results in a BooleanQuery where, due to mm=2, both clauses need to match, giving very few matches. Removing KeywordRepeat or setting mm=1 of course fixes the problem, but that is not what we want.

      Will shortly attatch a patch containing a unit test proving the problem is real.

      [1] http://lucene.472066.n3.nabble.com/KeywordRepeat-stemming-single-term-synonyms-and-minimum-should-match-edismax-td4413690.html

        Attachments

        1. SOLR-13009-unittest.patch
          6 kB
          Markus Jelsma

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              markus17 Markus Jelsma
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated: