Lucene - Core
  1. Lucene - Core
  2. LUCENE-1183

TRStringDistance uses way too much memory (with patch)

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: 1.9, 2.0.0, 2.1, 2.2, 2.3
    • Fix Version/s: 3.0
    • Component/s: modules/other
    • Labels:
      None
    • Lucene Fields:
      New, Patch Available

      Description

      The implementation of TRStringDistance is based on version 2.1 of org.apache.commons.lang.StringUtils#getLevenshteinDistance(String, String), which uses an un-optimized implementation of the Levenshtein Distance algorithm (it uses way too much memory). Please see Bug 38911 (http://issues.apache.org/bugzilla/show_bug.cgi?id=38911) for more information.

      The commons-lang implementation has been heavily optimized as of version 2.2 (3x speed-up). I have reported the new implementation to TRStringDistance.

      1. LUCENE-1183_3x.patch
        2 kB
        Robert Muir
      2. LUCENE-1183.patch
        7 kB
        Robert Muir
      3. FuzzyTermEnum.patch
        7 kB
        Cédrik LIME
      4. TRStringDistance.java
        4 kB
        Cédrik LIME
      5. TRStringDistance.patch
        6 kB
        Cédrik LIME

        Issue Links

          Activity

          Cédrik LIME created issue -
          Cédrik LIME made changes -
          Field Original Value New Value
          Attachment TRStringDistance.java [ 12376117 ]
          Attachment TRStringDistance.patch [ 12376116 ]
          Cédrik LIME made changes -
          Attachment FuzzyTermEnum.patch [ 12376124 ]
          Grant Ingersoll made changes -
          Priority Major [ 3 ] Minor [ 4 ]
          Lucene Fields [Patch Available, New] [New, Patch Available]
          Cédrik LIME made changes -
          Attachment FuzzyTermEnum.patch [ 12376129 ]
          Cédrik LIME made changes -
          Attachment FuzzyTermEnum.patch [ 12376124 ]
          Karl Wettin made changes -
          Link This issue is related to LUCENE-691 [ LUCENE-691 ]
          Grant Ingersoll made changes -
          Link This issue relates to LUCENE-691 [ LUCENE-691 ]
          Otis Gospodnetic made changes -
          Assignee Otis Gospodnetic [ otis ]
          Michael McCandless made changes -
          Status Open [ 1 ] Resolved [ 5 ]
          Fix Version/s 3.0 [ 12312889 ]
          Resolution Fixed [ 1 ]
          Uwe Schindler made changes -
          Status Resolved [ 5 ] Closed [ 6 ]
          Robert Muir made changes -
          Resolution Fixed [ 1 ]
          Status Closed [ 6 ] Reopened [ 4 ]
          Assignee Otis Gospodnetic [ otis ] Robert Muir [ rcmuir ]
          Robert Muir made changes -
          Attachment LUCENE-1183.patch [ 12451501 ]
          Robert Muir made changes -
          Attachment LUCENE-1183_3x.patch [ 12451504 ]
          Robert Muir made changes -
          Status Reopened [ 4 ] Resolved [ 5 ]
          Resolution Fixed [ 1 ]
          Mark Thomas made changes -
          Workflow jira [ 12424100 ] Default workflow, editable Closed status [ 12562506 ]
          Mark Thomas made changes -
          Workflow Default workflow, editable Closed status [ 12562506 ] jira [ 12583475 ]
          Uwe Schindler made changes -
          Status Resolved [ 5 ] Closed [ 6 ]

            People

            • Assignee:
              Robert Muir
              Reporter:
              Cédrik LIME
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - 10m
                10m
                Remaining:
                Remaining Estimate - 10m
                10m
                Logged:
                Time Spent - Not Specified
                Not Specified

                  Development