Uploaded image for project: 'Lucene.Net'
  1. Lucene.Net
  2. LUCENENET-551

Latin language Stemmer (feature request)

    XMLWordPrintableJSON

    Details

      Description

      I would find a Latin language stemmer very helpful. The Schinke Latin stemming algorithm has been converted to Snowball here: http://snowball.tartarus.org/otherapps/schinke/intro.html . I have not worked out how to compile Snowball into .cs to try it.

      There are currently 5 romance-languages supported (French, Spanish, Portuguese, Italian, Romanian). so if the above doesn't work, I imagine one of these could be modified to support Latin.

      I realise SF.Snowball is considered a contrib package rather than core, but Lucene.Net seems to be the main place where Snowball stemmers are provided and maintained for C# / .Net.

      Note, other language ports of Snowball support Latin (using the Schinke contribution), such as Ruby: https://github.com/aurelian/ruby-stemmer

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              pengo Peter Halasz
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated: