Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-9250

Search breaks with EU symbol € and wildcard *

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 5.3.1
    • None
    • Server
    • None

    Description

      While testing UTF-8 character searches, which worked, we have noticed a combination that fails. Testing with the data Tùûüÿ€àâæçéèêëïîôœm, we found the search worked, but by adding a wild-card (e.g. Tùûüÿ€àâæçéèêëïîôœm*), the search fails. Adding the wildcard before the symbol worked (i.e. Tùûüÿ*).

      Showing the logs for these queries:

      Full text without wildcard, hit=1
      2016-06-25 13:16:34.361 [qtp237852351-21] INFO  org.apache.solr.core.SolrCore.Request  – [core-name] webapp=/solr path=/select params={q=Tùûüÿ€àâæçéèêëïîôœm&indent=true&fq=type:CONTACT&rows=12&wt=json&_=1466860594348} hits=1 status=0 QTime=0 
      
      Full text with wildcard, hit=0
      2016-06-25 13:16:41.172 [qtp237852351-16] INFO  org.apache.solr.core.SolrCore.Request  – [core-name] webapp=/solr path=/select params={q=Tùûüÿ€àâæçéèêëïîôœm*&indent=true&fq=type:CONTACT&rows=12&wt=json&_=1466860601160} hits=0 status=0 QTime=0 
      
      Partial text before € with wildcard, hit=1
      2016-06-25 13:16:52.135 [qtp237852351-18] INFO  org.apache.solr.core.SolrCore.Request  – [core-name] webapp=/solr path=/select params={q=Tùûüÿ*&indent=true&fq=type:CONTACT&rows=12&wt=json&_=1466860612125} hits=1 status=0 QTime=2 
      

      Attachments

        1. contact-name-analyze.png
          78 kB
          Tim Nolan
        2. contact-name-field-type.png
          36 kB
          Tim Nolan

        Activity

          People

            Unassigned Unassigned
            timn Tim Nolan
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: