Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-1523

Use the snowball-data set to write language-specific stemmer eval tests

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 2.3.1
    • 2.3.2
    • Stemmer
    • None

    Description

      Investigate on the possibility to re-use https://github.com/snowballstem/snowball-data/tree/master in our eval data to run it against our stemmers to see how good they behave for certain languages.

      It contains of two files "vocab" (to be stemmed) and "output" (expected)

      Attachments

        Activity

          People

            rzo1 Richard Zowalla
            rzo1 Richard Zowalla
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: