Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-3375

processing a synonym in a token stream will remove the following token from the stream

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 3.4, 4.0-ALPHA
    • 3.4, 4.0-ALPHA
    • modules/analysis
    • None

    Description

      If you do a phrase search on a field derived from a fieldtype with the synonym filter which includes a synonym, the term following the synonym vanishes after synonym expansion.

      e.g. http://host:port/solr/corename/select/?q=desc:%22xyzzy%20%20bbb%20pot%20of%20gold%22&version=2.2&start=0&rows=10&indent=on&debugQuery=true (bbb is in the default synonyms file, desc is a "text" fieldtype)

      outputs
      ....
      <str name="rawquerystring">desc:"xyzzy bbb pot of gold"</str>
      <str name="querystring">desc:"xyzzy bbb pot of gold"</str>
      <str name="parsedquery">PhraseQuery(desc:"xyzzy bbbb 1 bbbb 2 of gold")</str>
      <str name="parsedquery_toString">desc:"xyzzy bbbb 1 bbbb 2 of gold"</str>
      ....

      You can also see this behavior using the admin console analysis.jsp

      Solr 3.3 behaves properly.

      Attachments

        1. SOLR-2709_test.patch
          2 kB
          Robert Muir
        2. LUCENE-3375_test.patch
          2 kB
          Robert Muir
        3. LUCENE-3375.patch
          8 kB
          Michael McCandless
        4. LUCENE-3375.patch
          16 kB
          Robert Muir

        Activity

          People

            Unassigned Unassigned
            simon.rosenthal Simon Rosenthal
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: