Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-11

QueryParser not recognizing asterisk with UTF-8 index

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Incomplete
    • None
    • None
    • core/queryparser
    • None
    • Operating System: Linux
      Platform: All

    • 6091

    Description

      Version: 1.2-RC3

      I've created an index of UTF-8 encoded documents and making sure that all
      queries are converted to UTF-8. When searching the index with query containing
      non-ascii UTF-8 characters and an asterisk, no results are found even though
      there are documents that contain the query word. Searching does work when query
      doesn't contain non-ascii UTF-8 characters or without asterisk works always.
      Test results with swedish words:
      "födde" - works ok, returns documents.
      "född*" - doesn't return any results.
      "född" - works ok, returns documents.
      "kom*" - works ok, returns documents.

      Attachments

        1. ASF.LICENSE.NOT.GRANTED--patch8.txt
          6 kB
          Jean-François Halleux

        Activity

          People

            java-dev@lucene.apache.org Lucene Developers
            tero@favorin.com Tero Favorin
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: