Uploaded image for project: 'UIMA'
  1. UIMA
  2. UIMA-3841

Ruta: greedy composed rule element ignores -contains condition

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.2.0ruta
    • 2.2.1ruta
    • Ruta
    • None

    Description

      I've recently had problems with Ruta's rule inference. The sentence was like "X y, y, X, y, X" and the rule was similar to :

      S {CONTAINS(x) -> UNMARK(S)} (COMMA? S)+ {CONTAINS(y), -CONTAINS(CW) -> UNMARKALL(S), MARK(S,1,2)};
      

      where S annotations are segments, initially:
      [X y], [y], [X], [y], [X]

      It ignores the condition -CONTAINS(CW) sometimes and annotates:
      [X y, y, X, y, X]

      The problem seems to be related to the composed rule element (COMMA? S)+ and the condition action part of it.

      Attachments

        Activity

          People

            pkluegl Peter Klügl
            mtoepfer Martin Toepfer
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: