Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-4542

Make RECURSION_CAP in HunspellStemmer configurable

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 4.0
    • Fix Version/s: 4.4, 6.0
    • Component/s: modules/analysis
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      Currently there is
      private static final int RECURSION_CAP = 2;
      in the code of the class HunspellStemmer. It makes using hunspell with several dictionaries almost unusable, due to bad performance (f.ex. it costs 36ms to stem long sentence in latvian for recursion_cap=2 and 5 ms for recursion_cap=1). It would be nice to be able to tune this number as needed.
      AFAIK this number (2) was chosen arbitrary.

      (it's a first issue in my life, so please forgive me any mistakes done).

        Attachments

        1. Lucene-4542-javadoc.patch
          8 kB
          Rafał Kuć
        2. LUCENE-4542-with-solr.patch
          7 kB
          Rafał Kuć
        3. LUCENE-4542.patch
          2 kB
          Rafał Kuć

          Issue Links

            Activity

              People

              • Assignee:
                steve_rowe Steve Rowe
                Reporter:
                zasnuty Piotr
              • Votes:
                0 Vote for this issue
                Watchers:
                9 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: