Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-5824

hunspell FLAG LONG implemented incorrectly

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 4.10, 6.0
    • None
    • None
    • New

    Description

      If you have more than 256 flags, you run out of 8-bit characters, so you have to use another flag type to get 64k:

      • UTF-8: 16-bit BMP flags
      • long: two-character flags like 'AB'
      • num: decimal numbers like '10234'

      But our implementation for 'long' is wrong, it encodes as 'A+B', which means it cant distinguish between 'AB' and 'BA' and causes overgeneration.

      Attachments

        1. LUCENE-5824.patch
          2 kB
          Robert Muir

        Activity

          People

            Unassigned Unassigned
            rcmuir Robert Muir
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment