Uploaded image for project: 'Commons Text'
  1. Commons Text
  2. TEXT-78

Investigate adding a spell checker

    XMLWordPrintableJSON

    Details

      Description

      There are already spell checkers such as Hunspell (which has its own API to define suffixes and prefix), and simpler spell checkers like jazzy (forked).

      Hunspell (GPL/mixed) is written in C++, and Java programs that use it normally use native access through BridJ/[JNA|https://github.com/dren-dk/HunspellJNA]. Jazzy (LGPL) is good, but not actively maintained, and simply looks up words from a dictionary.

      We can provide a simple interface, and perhaps one or two different spell checkers in pure Java.

      Having edit distances already in the project, we could investigate something such as adding a BK-Tree, an in-memory look-up spell checker, etc.

      When creating the initial interfaces & design, it will be important to remember that we want to support multiple languages too.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              kinow Bruno P. Kinoshita
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: