Uploaded image for project: 'UIMA'
  1. UIMA
  2. UIMA-2508

Improve lexer in default TextMarker seeding for html fragments

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 2.0.0TextMarker
    • 2.0.0TextMarker
    • Ruta
    • None

    Description

      The default seeding creates erroneously markup annotations because the applied regexp in the lexer is just too simple. The identifier should be based on something like: \<\/?\w+(([ \t\f]\w([ \t\f]=[ \t\f](\".?\"|\'.?\'|[^\'\"> \t\f]))?)[ \t\f]|[ \t\f])\/?\>

      Attachments

        Activity

          People

            pkluegl Peter Klügl
            pkluegl Peter Klügl
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: