Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-8480

Completion regex query uses UTF32 automaton

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • None
    • None
    • New

    Description

      The completion regex query builds an UTF-32 automaton but the completion FST uses UTF-8 internally. This makes the matching of any non basic latin character impossible in a regex completion query.

      Attachments

        Activity

          People

            Unassigned Unassigned
            jim.ferenczi Jim Ferenczi
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: