Issue Details (XML | Word | Printable)

Key: LUCENE-1183
Type: Improvement Improvement
Status: Closed Closed
Resolution: Fixed
Priority: Minor Minor
Assignee: Otis Gospodnetic
Reporter: Cédrik LIME
Votes: 0
Watchers: 1
Operations

If you were logged in you would be able to see more operations.
Lucene - Java

TRStringDistance uses way too much memory (with patch)

Created: 21/Feb/08 11:18 AM   Updated: Wednesday 04:47 PM
Return to search
Component/s: contrib/*
Affects Version/s: 1.9, 2.0.0, 2.1, 2.2, 2.3
Fix Version/s: 3.0

Time Tracking:
Original Estimate: 0.17h
Original Estimate - 0.17h
Remaining Estimate: 0.17h
Remaining Estimate - 0.17h
Time Spent: Not Specified
Remaining Estimate - 0.17h

File Attachments:
  Size
Text File Licensed for inclusion in ASF works FuzzyTermEnum.patch 2008-02-21 03:27 PM Cédrik LIME 7 kB
Java Source File Licensed for inclusion in ASF works TRStringDistance.java 2008-02-21 11:22 AM Cédrik LIME 4 kB
Text File Licensed for inclusion in ASF works TRStringDistance.patch 2008-02-21 11:22 AM Cédrik LIME 6 kB
Issue Links:
Reference

Lucene Fields: Patch Available, New
Resolution Date: 20/Oct/09 09:23 PM


 Description  « Hide
The implementation of TRStringDistance is based on version 2.1 of org.apache.commons.lang.StringUtils#getLevenshteinDistance(String, String), which uses an un-optimized implementation of the Levenshtein Distance algorithm (it uses way too much memory). Please see Bug 38911 (http://issues.apache.org/bugzilla/show_bug.cgi?id=38911) for more information.

The commons-lang implementation has been heavily optimized as of version 2.2 (3x speed-up). I have reported the new implementation to TRStringDistance.



 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
Repository Revision Date User Message
ASF #659016 Thu May 22 06:24:55 UTC 2008 otis LUCENE-1183: Optimized TRStringDistance class (in contrib/spell) that uses less memory than the previous version
Files Changed
MODIFY /lucene/java/trunk/contrib/spellchecker/src/java/org/apache/lucene/search/spell/TRStringDistance.java
MODIFY /lucene/java/trunk/CHANGES.txt

Repository Revision Date User Message
ASF #827800 Tue Oct 20 21:22:52 UTC 2009 mikemccand LUCENE-1183: optimize Levenshtein distance computation in FuzzyQuery
Files Changed
MODIFY /lucene/java/trunk/src/java/org/apache/lucene/search/FuzzyTermEnum.java
MODIFY /lucene/java/trunk/CHANGES.txt