• Type: Improvement Improvement
    • Status: Resolved
    • Priority: Trivial Trivial
    • Resolution: Fixed
    • Affects Version/s: 2.0.0
    • Fix Version/s: None
    • Component/s: general/javadocs
    • Labels:
    • Lucene Fields:
      New, Patch Available


      Added some javadocs that explains why the spellchecker does not work as one might expect it to.

      > Without having looked at the code for a long time, I think the problem is what the
      > lucene scoring consider to be best. First the grams are searched, resulting in a number
      > of hits. Then the edit-distance is calculated on each hit. "Genetics" is appearently the
      > third most similar hit according to Lucene, but the best according to Levenshtein.
      > I.e. Lucene does not use edit-distance as similarity. You need to get a bunch of best hits
      > in order to find the one with the smallest edit-distance.

      I took a look at the code, and my assessment seems to be right.


        Karl Wettin created issue -
        Karl Wettin made changes -
        Field Original Value New Value
        Attachment spellcheck_javadocs.diff [ 12349692 ]
        Otis Gospodnetic made changes -
        Assignee Otis Gospodnetic [ otis ]
        Otis Gospodnetic made changes -
        Status Open [ 1 ] Resolved [ 5 ]
        Lucene Fields [Patch Available, New] [New, Patch Available]
        Resolution Fixed [ 1 ]
        Mark Thomas made changes -
        Workflow jira [ 12395113 ] Default workflow, editable Closed status [ 12562236 ]
        Mark Thomas made changes -
        Workflow Default workflow, editable Closed status [ 12562236 ] jira [ 12583244 ]


          • Assignee:
            Otis Gospodnetic
            Karl Wettin
          • Votes:
            0 Vote for this issue
            0 Start watching this issue


            • Created: