Uploaded image for project: 'UIMA'
  1. UIMA
  2. UIMA-2524

TextMarker html conversion to plain text is not working correctly

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 2.0.0TextMarker
    • 2.0.0TextMarker
    • Ruta
    • None

    Description

      The HTMLAnnoator shipped with TextMarker is able to strip the html tag and to create an additional view with the plain text. During this step the tag information is converted to annotations, whose offsets are adapted according to the removed tags. This functionality is not working correctly: the tags of the body of the html document are not removed.

      Attachments

        Activity

          People

            pkluegl Peter Klügl
            pkluegl Peter Klügl
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: