Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-5505

hunspell SET/FLAG whitespace/BOM handling

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 4.8, 6.0
    • Component/s: None
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      Several dictionaries cannot be loaded today (Armenian, Papiamento, Macedonian, Russian, Urdu) because they have stuff like SET<tab>UTF-8, FLAG<space><space>UTF-8 or have a BOM marker on the first line (or even combinations of these).

      Also because SET need not be the first line in the file, we should ignore BOM markers on the first line in general (e.g. it might be something else like FLAG).

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              rcmuir Robert Muir
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: