Issue Details (XML | Word | Printable)

Key: LUCENE-1183
Type: Improvement Improvement
Status: Closed Closed
Resolution: Fixed
Priority: Minor Minor
Assignee: Otis Gospodnetic
Reporter: Cédrik LIME
Votes: 0
Watchers: 1
Operations

If you were logged in you would be able to see more operations.
Lucene - Java

TRStringDistance uses way too much memory (with patch)

Created: 21/Feb/08 11:18 AM   Updated: 25/Nov/09 04:47 PM
Return to search
Component/s: contrib/*
Affects Version/s: 1.9, 2.0.0, 2.1, 2.2, 2.3
Fix Version/s: 3.0

Time Tracking:
Original Estimate: 0.17h
Original Estimate - 0.17h
Remaining Estimate: 0.17h
Remaining Estimate - 0.17h
Time Spent: Not Specified
Remaining Estimate - 0.17h

File Attachments:
  Size
Text File Licensed for inclusion in ASF works FuzzyTermEnum.patch 2008-02-21 03:27 PM Cédrik LIME 7 kB
Java Source File Licensed for inclusion in ASF works TRStringDistance.java 2008-02-21 11:22 AM Cédrik LIME 4 kB
Text File Licensed for inclusion in ASF works TRStringDistance.patch 2008-02-21 11:22 AM Cédrik LIME 6 kB
Issue Links:
Reference

Lucene Fields: Patch Available, New
Resolution Date: 20/Oct/09 09:23 PM


 Description  « Hide
The implementation of TRStringDistance is based on version 2.1 of org.apache.commons.lang.StringUtils#getLevenshteinDistance(String, String), which uses an un-optimized implementation of the Levenshtein Distance algorithm (it uses way too much memory). Please see Bug 38911 (http://issues.apache.org/bugzilla/show_bug.cgi?id=38911) for more information.

The commons-lang implementation has been heavily optimized as of version 2.2 (3x speed-up). I have reported the new implementation to TRStringDistance.



 All   Comments   Work Log   Change History   Subversion Commits      Sort Order: Ascending order - Click to sort in descending order
No work has yet been logged on this issue.