Lucene - Core
  1. Lucene - Core
  2. LUCENE-2055

Fix buggy stemmers and Remove duplicate analysis functionality

    Details

    • Lucene Fields:
      New

      Description

      would like to remove stemmers in the following packages, and instead in their analyzers use a SnowballStemFilter instead.

      • analyzers/fr
      • analyzers/nl
      • analyzers/ru

      below are excerpts from this code where they proudly proclaim they use the snowball algorithm.
      I think we should delete all of this custom stemming code in favor of the actual snowball package.

      /**
       * A stemmer for French words. 
       * <p>
       * The algorithm is based on the work of
       * Dr Martin Porter on his snowball project<br>
       * refer to http://snowball.sourceforge.net/french/stemmer.html<br>
       * (French stemming algorithm) for details
       * </p>
       */
      
      public class FrenchStemmer {
      
      /**
       * A stemmer for Dutch words. 
       * <p>
       * The algorithm is an implementation of
       * the <a href="http://snowball.tartarus.org/algorithms/dutch/stemmer.html">dutch stemming</a>
       * algorithm in Martin Porter's snowball project.
       * </p>
       */
      public class DutchStemmer {
      
      /**
       * Russian stemming algorithm implementation (see http://snowball.sourceforge.net for detailed description).
       */
      class RussianStemmer
      
      1. LUCENE-2055.patch
        179 kB
        Robert Muir
      2. LUCENE-2055.patch
        168 kB
        Robert Muir
      3. LUCENE-2055.patch
        169 kB
        Robert Muir
      4. LUCENE-2055.patch
        163 kB
        Robert Muir
      5. LUCENE-2055.patch
        163 kB
        Robert Muir

        Issue Links

          Activity

          No work has yet been logged on this issue.

            People

            • Assignee:
              Robert Muir
              Reporter:
              Robert Muir
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development