Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-9687

Hunspell support improvements

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 9.0, 8.9
    • None
    • None
    • New

    Description

      I'd like Lucene's Hunspell support to be on a par with the native C++ Hunspell for spellchecking and suggestions, at least for some languages. So I propose to:

      • support the affix rules necessary for English, German, French, Spanish and
        Russian dictionaries, possibly more languages later
      • mirror Hunspell's suggestion algorithm in Lucene
      • provide a public APIs for spellchecking, suggestion, stemming, morphological data
      • check corpora for specific languages to find and fix spellchecking/suggestion discrepancices between Lucene's implementation and Hunspell/C++

      Attachments

        There are no Sub-Tasks for this issue.

        Activity

          People

            Unassigned Unassigned
            Gromov Peter Gromov
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 105.5h
                105.5h