Uploaded image for project: 'Lucene.Net'
  1. Lucene.Net
  2. LUCENENET-366

Spellchecker issues

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • None
    • None

    Description

      There are several issues with the spellchecker:

      • It doesn't do duplicate checking across updates (so the same word is often indexed many, many times)
      • The n-gram fields are stored as well as indexed, which increases the size of the index by several orders of magnitude and provides no benefit
      • Some deprecated functions are used, which slows it down
      • Some methods aren't commented fully

      I will attach a patch that fixes these.

      Attachments

        1. LUCENENET-366.patch
          2 kB
          Ben West
        2. LUCENENET-366-spellcheck29.patch
          43 kB
          Ben West
        3. LuceneNet-SpellcheckFixes.patch
          9 kB
          Ben West
        4. spellcheck-2.9-upgrade.patch
          96 kB
          Ben West
        5. spellcheck-29.patch
          79 kB
          Ben West

        Activity

          People

            digydigy Digy
            xodarap Ben West
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: