Uploaded image for project: 'OpenNLP'
  1. OpenNLP
  2. OPENNLP-676

POSTagger UIMA AE broken because of AnnotationComboIterator

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: tools-1.5.3
    • Fix Version/s: 1.6.0
    • Labels:
      None
    • Environment:
      Oracle JDK8, Debian Jessie 64b

      Description

      The AnnotationComboIterator helper class used by the UIMA POSTagger accesses its iterators unsafely.

      The consequence is that the AE breaks even on very simple CASes such as the CAS showcased on this repository (text of 9 letters, 2 sentence annotations and 9 token annotations): https://github.com/m09/postagger-iterator-bug/blob/master/in.xmi

      The repository linked above contains an example program that crashes on my setup. It's fully maven 3 aware so you can normally launch it quite easily.

      Here is a patch that should address the issue: https://raw.githubusercontent.com/m09/postagger-iterator-bug/master/iterator.patch

        Attachments

          Activity

            People

            • Assignee:
              joern Jörn Kottmann
              Reporter:
              m09 Hugo Mougard
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: