Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-3888

split off the spell check word and surface form in spell check dictionary

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • 4.9, 6.0
    • modules/spellchecker
    • None
    • New

    Description

      The "did you mean?" feature by using Lucene's spell checker cannot work well for Japanese environment unfortunately and is the longstanding problem, because the logic needs comparatively long text to check spells, but for some languages (e.g. Japanese), most words are too short to use the spell checker.

      I think, for at least Japanese, the things can be improved if we split off the spell check word and surface form in the spell check dictionary. Then we can use ReadingAttribute for spell checking but CharTermAttribute for suggesting, for example.

      Attachments

        1. LUCENE-3888.patch
          4 kB
          Koji Sekiguchi
        2. LUCENE-3888.patch
          6 kB
          Robert Muir
        3. LUCENE-3888.patch
          6 kB
          Robert Muir
        4. LUCENE-3888.patch
          14 kB
          Koji Sekiguchi
        5. LUCENE-3888.patch
          16 kB
          Robert Muir
        6. LUCENE-3888.patch
          26 kB
          Robert Muir

        Activity

          People

            koji Koji Sekiguchi
            koji Koji Sekiguchi
            Votes:
            2 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: