Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-1479

Write better tests for pattern verification (tokenizers)

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.1.1
    • 2.3.2
    • Tokenizer
    • None

    Description

      From https://github.com/apache/opennlp/pull/516#issuecomment-1455015772

      At the moment our tests verify that the tokenizer objects are created correctly (i.e. tests getters and setters, constructor, etc.), without verifying the actual behavior when used in conjunction with other classes (factory, tokenizer, trainers, etc).

      It would be best to test the patterns used in the factories for different languages with some interesting sample data (maybe something from project gutenberg, open source news sites, etc.).

      Attachments

        Issue Links

          Activity

            People

              l-ma Lara Marinov
              kinow Bruno P. Kinoshita
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: