Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-2673

CJKAnalyzer not matching mutlibyte character followed by non-multibyte character

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.0.1
    • None
    • modules/analysis
    • None
    • New

    Description

      Here is a listing of text indexed in a field, followed by various search terms that did or did not match the document.

      [QES様文字化けテスト]
      QES -> retrievable
      QES様 -> not retrievable
      QES様文字化けテスト -> retrievable

      [SOA基盤]
      SOA ->retrievable
      SOA基 -> not retrievable
      SOA基盤 -> retrievable

      [日経BP]
      日経 -> retrievable
      日経B -> not retrievable
      日経BP -> retrievable

      Attachments

        Activity

          People

            Unassigned Unassigned
            kphayen Kevin Hayen
            Votes:
            1 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated: