Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-205

Refactor the SentenceDetectorME class to do the mapping of end-of-sent positions to spans better

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Sentence Detector
    • None

    Description

      The SentenceDectorME class should be refactored to improve the mapping of end-of-sent positions to spans better. The current code tries to eliminate white spaces between to sentences, but this code fails in case the UseTokenEnd option is set to false. If set to true the sentence detector might not work correctly in all cases.

      Attachments

        Activity

          People

            Unassigned Unassigned
            joern Jörn Kottmann
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated: