Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-9220

Upgrade Snowball version to 2.0

Details

    • Wish
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 9.0
    • None
    • None
    • New

    Description

      When working with Snowball-based stemmers, I realized that Lucene is currently using a pre-compiled version of Snowball, that seems from 12 years ago: https://github.com/snowballstem/snowball/tree/e103b5c257383ee94a96e7fc58cab3c567bf079b

      Snowball has just released v2.0 in 10/2019 with many improvements, new supported languages ( Arabic, Indonesian…) and new features ( stringdef notation for Unicode codepoints…). Details of the changes could be found here: https://github.com/snowballstem/snowball/blob/master/NEWS. I think these changes of Snowball could give a promising positive impact on Lucene.

      I wonder when Lucene should upgrade Snowball to the latest version ( v2.0).

      Attachments

        1. snowball_53739a805cfa6c.patch
          38 kB
          Robert Muir
        2. snowball_53739a805cfa6c.patch
          37 kB
          Robert Muir
        3. snowball_53739a805cfa6c.patch
          35 kB
          Robert Muir

        Issue Links

          Activity

            People

              Unassigned Unassigned
              huynmg Nguyen Minh Gia Huy
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 5.5h
                  5.5h