Uploaded image for project: 'cTAKES'
  1. cTAKES
  2. CTAKES-508

HTMLTextWriter encodes some characters incorrectly in Firefox

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 4.0.0
    • None
    • ctakes-core

    Description

      Some documents output by the HTML pretty text writer (org.apache.ctakes.core.cc.pretty.html.HtmlTextWriter) show up in Firefox 59.0.3 with the A-with-circumflex character showing up where there should be a &nbsp, as described in the StackOverflow post below (in an unrelated context):

      https://stackoverflow.com/questions/1461907

      This seems to be because the writer implicitly uses UTF-8 but does not specify an encoding in the HTML file. Firefox defaults to Western rather than UTF-8 and so displays characters incorrectly.

      This could be solved by adding a META tag to the HTML documents that are written and specifying UTF-8 encoding.

      Attachments

        Activity

          People

            Unassigned Unassigned
            smullane Sean Mullane
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: