Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-9220

Upgrade Snowball version to 2.0

    XMLWordPrintableJSON

    Details

    • Type: Wish
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: main (9.0)
    • Component/s: None
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      When working with Snowball-based stemmers, I realized that Lucene is currently using a pre-compiled version of Snowball, that seems from 12 years ago: https://github.com/snowballstem/snowball/tree/e103b5c257383ee94a96e7fc58cab3c567bf079b

      Snowball has just released v2.0 in 10/2019 with many improvements, new supported languages ( Arabic, Indonesian…) and new features ( stringdef notation for Unicode codepoints…). Details of the changes could be found here: https://github.com/snowballstem/snowball/blob/master/NEWS. I think these changes of Snowball could give a promising positive impact on Lucene.

      I wonder when Lucene should upgrade Snowball to the latest version ( v2.0).

        Attachments

        1. snowball_53739a805cfa6c.patch
          35 kB
          Robert Muir
        2. snowball_53739a805cfa6c.patch
          37 kB
          Robert Muir
        3. snowball_53739a805cfa6c.patch
          38 kB
          Robert Muir

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                huynmg Nguyen Minh Gia Huy
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 5.5h
                  5.5h