Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-4542

Make RECURSION_CAP in HunspellStemmer configurable

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 4.0
    • 4.4, 6.0
    • modules/analysis
    • None
    • New

    Description

      Currently there is
      private static final int RECURSION_CAP = 2;
      in the code of the class HunspellStemmer. It makes using hunspell with several dictionaries almost unusable, due to bad performance (f.ex. it costs 36ms to stem long sentence in latvian for recursion_cap=2 and 5 ms for recursion_cap=1). It would be nice to be able to tune this number as needed.
      AFAIK this number (2) was chosen arbitrary.

      (it's a first issue in my life, so please forgive me any mistakes done).

      Attachments

        1. LUCENE-4542.patch
          2 kB
          Rafał Kuć
        2. LUCENE-4542-with-solr.patch
          7 kB
          Rafał Kuć
        3. Lucene-4542-javadoc.patch
          8 kB
          Rafał Kuć

        Issue Links

          Activity

            People

              sarowe Steven Rowe
              zasnuty Piotr
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: