Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 5.3
    • Component/s: None
    • Lucene Fields:
      New, Patch Available

      Description

      Snowball stemmer for Lithuanian language.

      1. LUCENE-6694.patch
        79 kB
        Robert Muir
      2. LUCENE-6694.patch
        31 kB
        Dainius Jocas
      3. stem_ISO_8859_1.sbl
        13 kB
        Dainius Jocas

        Activity

        Hide
        Robert Muir added a comment -

        The patch file only contains autogenerated code. where is the snowball source?

        Show
        Robert Muir added a comment - The patch file only contains autogenerated code. where is the snowball source?
        Hide
        Dainius Jocas added a comment -

        Snowball source code can be found here.

        Also, I've made a pull request to the official snowball repository 9 days ago.

        Show
        Dainius Jocas added a comment - Snowball source code can be found here . Also, I've made a pull request to the official snowball repository 9 days ago.
        Hide
        Robert Muir added a comment -

        OK, until they incorporate it, i think we should add the .sbl file here as well so its available.

        And you can confirm its ok for us to release this under the apache 2.0 license?

        I will take a deeper look and comment again after!

        Show
        Robert Muir added a comment - OK, until they incorporate it, i think we should add the .sbl file here as well so its available. And you can confirm its ok for us to release this under the apache 2.0 license? I will take a deeper look and comment again after!
        Hide
        Dainius Jocas added a comment -

        Snowball source code.

        Show
        Dainius Jocas added a comment - Snowball source code.
        Hide
        Dainius Jocas added a comment -

        Yes, I confirm that it is OK to use Apache 2.0 license.

        Show
        Dainius Jocas added a comment - Yes, I confirm that it is OK to use Apache 2.0 license.
        Hide
        Robert Muir added a comment -

        Updated patch adding LithuanianAnalyzer, a stopwords set, and some basic tests.

        Show
        Robert Muir added a comment - Updated patch adding LithuanianAnalyzer, a stopwords set, and some basic tests.
        Hide
        Robert Muir added a comment -

        Thank for the contribution here! We really need support for this language and I like the stemmer. It seems to work well with nouns and adjectives and does not seem to suffer from overstemming issues.

        If you have a chance, please have a look at the latest patch. I will look into this more tomorrow.

        Show
        Robert Muir added a comment - Thank for the contribution here! We really need support for this language and I like the stemmer. It seems to work well with nouns and adjectives and does not seem to suffer from overstemming issues. If you have a chance, please have a look at the latest patch. I will look into this more tomorrow.
        Hide
        Dainius Jocas added a comment -

        Thanks! When designing the stemmer, I've put emphasis on improving noun stemming because majority of complaints from our clients were mistakes in noun stemming. Of course, Lithuanian language is complicated and there are enough space for improvements.

        I've checked the latest patch and found no problems.

        Show
        Dainius Jocas added a comment - Thanks! When designing the stemmer, I've put emphasis on improving noun stemming because majority of complaints from our clients were mistakes in noun stemming. Of course, Lithuanian language is complicated and there are enough space for improvements. I've checked the latest patch and found no problems.
        Hide
        Robert Muir added a comment -

        Thanks for checking! I plan to commit this later today.

        Show
        Robert Muir added a comment - Thanks for checking! I plan to commit this later today.
        Hide
        ASF subversion and git services added a comment -

        Commit 1692544 from Robert Muir in branch 'dev/trunk'
        [ https://svn.apache.org/r1692544 ]

        LUCENE-6694: Add LithuanianAnalyzer and LithuanianStemmer

        Show
        ASF subversion and git services added a comment - Commit 1692544 from Robert Muir in branch 'dev/trunk' [ https://svn.apache.org/r1692544 ] LUCENE-6694 : Add LithuanianAnalyzer and LithuanianStemmer
        Hide
        ASF subversion and git services added a comment -

        Commit 1692547 from Robert Muir in branch 'dev/branches/branch_5x'
        [ https://svn.apache.org/r1692547 ]

        LUCENE-6694: Add LithuanianAnalyzer and LithuanianStemmer

        Show
        ASF subversion and git services added a comment - Commit 1692547 from Robert Muir in branch 'dev/branches/branch_5x' [ https://svn.apache.org/r1692547 ] LUCENE-6694 : Add LithuanianAnalyzer and LithuanianStemmer
        Hide
        Robert Muir added a comment -

        Thanks Dainius!

        Show
        Robert Muir added a comment - Thanks Dainius!
        Hide
        Shalin Shekhar Mangar added a comment -

        Bulk close for 5.3.0 release

        Show
        Shalin Shekhar Mangar added a comment - Bulk close for 5.3.0 release

          People

          • Assignee:
            Unassigned
            Reporter:
            Dainius Jocas
          • Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development