Uploaded image for project: 'Commons Text'
  1. Commons Text
  2. TEXT-78

Investigate adding a spell checker

    XMLWordPrintableJSON

Details

    Description

      There are already spell checkers such as Hunspell (which has its own API to define suffixes and prefix), and simpler spell checkers like jazzy (forked).

      Hunspell (GPL/mixed) is written in C++, and Java programs that use it normally use native access through BridJ/[JNA|https://github.com/dren-dk/HunspellJNA]. Jazzy (LGPL) is good, but not actively maintained, and simply looks up words from a dictionary.

      We can provide a simple interface, and perhaps one or two different spell checkers in pure Java.

      Having edit distances already in the project, we could investigate something such as adding a BK-Tree, an in-memory look-up spell checker, etc.

      When creating the initial interfaces & design, it will be important to remember that we want to support multiple languages too.

      Attachments

        Activity

          People

            Unassigned Unassigned
            kinow Bruno P. Kinoshita
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: