Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-3888

split off the spell check word and surface form in spell check dictionary

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Minor
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: 4.9, 6.0
    • Component/s: modules/spellchecker
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      The "did you mean?" feature by using Lucene's spell checker cannot work well for Japanese environment unfortunately and is the longstanding problem, because the logic needs comparatively long text to check spells, but for some languages (e.g. Japanese), most words are too short to use the spell checker.

      I think, for at least Japanese, the things can be improved if we split off the spell check word and surface form in the spell check dictionary. Then we can use ReadingAttribute for spell checking but CharTermAttribute for suggesting, for example.

        Attachments

        1. LUCENE-3888.patch
          4 kB
          Koji Sekiguchi
        2. LUCENE-3888.patch
          6 kB
          Robert Muir
        3. LUCENE-3888.patch
          6 kB
          Robert Muir
        4. LUCENE-3888.patch
          14 kB
          Koji Sekiguchi
        5. LUCENE-3888.patch
          16 kB
          Robert Muir
        6. LUCENE-3888.patch
          26 kB
          Robert Muir

          Activity

            People

            • Assignee:
              koji Koji Sekiguchi
              Reporter:
              koji Koji Sekiguchi
            • Votes:
              2 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated: