Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 5.3
    • Component/s: None
    • Lucene Fields:
      New, Patch Available

      Description

      Snowball stemmer for Lithuanian language.

      1. LUCENE-6694.patch
        31 kB
        Dainius Jocas
      2. stem_ISO_8859_1.sbl
        13 kB
        Dainius Jocas
      3. LUCENE-6694.patch
        79 kB
        Robert Muir

        Activity

        Hide
        rcmuir Robert Muir added a comment -

        The patch file only contains autogenerated code. where is the snowball source?

        Show
        rcmuir Robert Muir added a comment - The patch file only contains autogenerated code. where is the snowball source?
        Hide
        djocas Dainius Jocas added a comment -

        Snowball source code can be found here.

        Also, I've made a pull request to the official snowball repository 9 days ago.

        Show
        djocas Dainius Jocas added a comment - Snowball source code can be found here . Also, I've made a pull request to the official snowball repository 9 days ago.
        Hide
        rcmuir Robert Muir added a comment -

        OK, until they incorporate it, i think we should add the .sbl file here as well so its available.

        And you can confirm its ok for us to release this under the apache 2.0 license?

        I will take a deeper look and comment again after!

        Show
        rcmuir Robert Muir added a comment - OK, until they incorporate it, i think we should add the .sbl file here as well so its available. And you can confirm its ok for us to release this under the apache 2.0 license? I will take a deeper look and comment again after!
        Hide
        djocas Dainius Jocas added a comment -

        Snowball source code.

        Show
        djocas Dainius Jocas added a comment - Snowball source code.
        Hide
        djocas Dainius Jocas added a comment -

        Yes, I confirm that it is OK to use Apache 2.0 license.

        Show
        djocas Dainius Jocas added a comment - Yes, I confirm that it is OK to use Apache 2.0 license.
        Hide
        rcmuir Robert Muir added a comment -

        Updated patch adding LithuanianAnalyzer, a stopwords set, and some basic tests.

        Show
        rcmuir Robert Muir added a comment - Updated patch adding LithuanianAnalyzer, a stopwords set, and some basic tests.
        Hide
        rcmuir Robert Muir added a comment -

        Thank for the contribution here! We really need support for this language and I like the stemmer. It seems to work well with nouns and adjectives and does not seem to suffer from overstemming issues.

        If you have a chance, please have a look at the latest patch. I will look into this more tomorrow.

        Show
        rcmuir Robert Muir added a comment - Thank for the contribution here! We really need support for this language and I like the stemmer. It seems to work well with nouns and adjectives and does not seem to suffer from overstemming issues. If you have a chance, please have a look at the latest patch. I will look into this more tomorrow.
        Hide
        djocas Dainius Jocas added a comment -

        Thanks! When designing the stemmer, I've put emphasis on improving noun stemming because majority of complaints from our clients were mistakes in noun stemming. Of course, Lithuanian language is complicated and there are enough space for improvements.

        I've checked the latest patch and found no problems.

        Show
        djocas Dainius Jocas added a comment - Thanks! When designing the stemmer, I've put emphasis on improving noun stemming because majority of complaints from our clients were mistakes in noun stemming. Of course, Lithuanian language is complicated and there are enough space for improvements. I've checked the latest patch and found no problems.
        Hide
        rcmuir Robert Muir added a comment -

        Thanks for checking! I plan to commit this later today.

        Show
        rcmuir Robert Muir added a comment - Thanks for checking! I plan to commit this later today.
        Hide
        jira-bot ASF subversion and git services added a comment -

        Commit 1692544 from Robert Muir in branch 'dev/trunk'
        [ https://svn.apache.org/r1692544 ]

        LUCENE-6694: Add LithuanianAnalyzer and LithuanianStemmer

        Show
        jira-bot ASF subversion and git services added a comment - Commit 1692544 from Robert Muir in branch 'dev/trunk' [ https://svn.apache.org/r1692544 ] LUCENE-6694 : Add LithuanianAnalyzer and LithuanianStemmer
        Hide
        jira-bot ASF subversion and git services added a comment -

        Commit 1692547 from Robert Muir in branch 'dev/branches/branch_5x'
        [ https://svn.apache.org/r1692547 ]

        LUCENE-6694: Add LithuanianAnalyzer and LithuanianStemmer

        Show
        jira-bot ASF subversion and git services added a comment - Commit 1692547 from Robert Muir in branch 'dev/branches/branch_5x' [ https://svn.apache.org/r1692547 ] LUCENE-6694 : Add LithuanianAnalyzer and LithuanianStemmer
        Hide
        rcmuir Robert Muir added a comment -

        Thanks Dainius!

        Show
        rcmuir Robert Muir added a comment - Thanks Dainius!
        Hide
        shalinmangar Shalin Shekhar Mangar added a comment -

        Bulk close for 5.3.0 release

        Show
        shalinmangar Shalin Shekhar Mangar added a comment - Bulk close for 5.3.0 release

          People

          • Assignee:
            Unassigned
            Reporter:
            djocas Dainius Jocas
          • Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development