Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-3976

Improve error messages for unsupported Hunspell formats

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Won't Fix
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: modules/analysis
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      Our hunspell implementation is never going to be able to support the huge variety of formats that are out there, especially since our impl is based on papers written on the topic rather than being a pure port.

      Recently we ran into the following suffix rule:

      SFX CA 0 /CaCp

      Due to the missing regex conditional, an AOE was being thrown, which made it difficult to diagnose the problem.

      We should instead try to provide better error messages showing what we were unable to parse.

        Attachments

        1. LUCENE-3976.patch
          1 kB
          Luca Cavanna
        2. LUCENE-3976.patch
          4 kB
          Luca Cavanna

          Activity

            People

            • Assignee:
              cmale Chris Male
              Reporter:
              cmale Chris Male
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: