Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-3375

processing a synonym in a token stream will remove the following token from the stream

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 3.4, 4.0-ALPHA
    • Fix Version/s: 3.4, 4.0-ALPHA
    • Component/s: modules/analysis
    • Labels:
      None
    • Environment:

      Description

      If you do a phrase search on a field derived from a fieldtype with the synonym filter which includes a synonym, the term following the synonym vanishes after synonym expansion.

      e.g. http://host:port/solr/corename/select/?q=desc:%22xyzzy%20%20bbb%20pot%20of%20gold%22&version=2.2&start=0&rows=10&indent=on&debugQuery=true (bbb is in the default synonyms file, desc is a "text" fieldtype)

      outputs
      ....
      <str name="rawquerystring">desc:"xyzzy bbb pot of gold"</str>
      <str name="querystring">desc:"xyzzy bbb pot of gold"</str>
      <str name="parsedquery">PhraseQuery(desc:"xyzzy bbbb 1 bbbb 2 of gold")</str>
      <str name="parsedquery_toString">desc:"xyzzy bbbb 1 bbbb 2 of gold"</str>
      ....

      You can also see this behavior using the admin console analysis.jsp

      Solr 3.3 behaves properly.

        Attachments

        1. LUCENE-3375.patch
          16 kB
          Robert Muir
        2. LUCENE-3375.patch
          8 kB
          Michael McCandless
        3. LUCENE-3375_test.patch
          2 kB
          Robert Muir
        4. SOLR-2709_test.patch
          2 kB
          Robert Muir

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              simon.rosenthal Simon Rosenthal
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: