Uploaded image for project: 'UIMA'
  1. UIMA
  2. UIMA-3841

Ruta: greedy composed rule element ignores -contains condition

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.2.0ruta
    • Fix Version/s: 2.2.1ruta
    • Component/s: Ruta
    • Labels:
      None

      Description

      I've recently had problems with Ruta's rule inference. The sentence was like "X y, y, X, y, X" and the rule was similar to :

      S {CONTAINS(x) -> UNMARK(S)} (COMMA? S)+ {CONTAINS(y), -CONTAINS(CW) -> UNMARKALL(S), MARK(S,1,2)};
      

      where S annotations are segments, initially:
      [X y], [y], [X], [y], [X]

      It ignores the condition -CONTAINS(CW) sometimes and annotates:
      [X y, y, X, y, X]

      The problem seems to be related to the composed rule element (COMMA? S)+ and the condition action part of it.

        Attachments

          Activity

            People

            • Assignee:
              pkluegl Peter Klügl
              Reporter:
              mtoepfer Martin Toepfer
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: