Details
-
Improvement
-
Status: Closed
-
Minor
-
Resolution: Fixed
-
1.9, 2.0.0, 2.1, 2.2, 2.3
-
None
-
New, Patch Available
Description
The implementation of TRStringDistance is based on version 2.1 of org.apache.commons.lang.StringUtils#getLevenshteinDistance(String, String), which uses an un-optimized implementation of the Levenshtein Distance algorithm (it uses way too much memory). Please see Bug 38911 (http://issues.apache.org/bugzilla/show_bug.cgi?id=38911) for more information.
The commons-lang implementation has been heavily optimized as of version 2.2 (3x speed-up). I have reported the new implementation to TRStringDistance.
Attachments
Attachments
Issue Links
- is related to
-
LUCENE-691 Bob Carpenter's FuzzyTermEnum refactoring
- Resolved
- relates to
-
LUCENE-691 Bob Carpenter's FuzzyTermEnum refactoring
- Resolved