Uploaded image for project: 'Maven Doxia'
  1. Maven Doxia
  2. DOXIA-278

Character encoding autodetection fails for APT source files

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Not A Problem
    • 1.0-alpha-11
    • None
    • Module - Apt
    • None
    • Mac OS X 10.5.6, Java 1.6.0_07

    Description

      Doxia unnecessarily forces all APT source files to be encoded in ISO-8859-1. Files encoded in UTF-8 can have garbage characters as a result. Doxia should be able to autodetect the encoding of the APT file to prevent this problem, as it already does for XML (see DOXIA-133).

      A test case is attached. It includes two APT source files, one encoded in ISO-8859-1 and another encoded in UTF-8. Both contain the copyright symbol. To reproduce the problem, simply run "mvn site" on the project and open the target/site/test-utf8.html and target/site/test-iso-8859-1.html. The file encoded with ISO-8859-1 should display the copyright symbol correctly, while the one encoded with UTF-8 contains a garbage character immediately before the symbol.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            hboutemy Herve Boutemy
            vocaro Trevor Harmon
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment