Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-201

Sentence Detector Trainer stops reading data when it contains two empty lines

    XMLWordPrintableJSON

Details

    Description

      The Sentence Detector Trainer stops reading the training data when the input stream contains two or more empty lines. Empty lines are used to mark document boundaries.

      To fix this issue the training data reading code should treat multiple empty lines in the same way as one empty line.

      Attachments

        Activity

          People

            joern Jörn Kottmann
            joern Jörn Kottmann
            Votes:
            1 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: