Uploaded image for project: 'Maven Doxia'
  1. Maven Doxia
  2. DOXIA-278

Character encoding autodetection fails for APT source files

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Not A Problem
    • 1.0-alpha-11
    • None
    • Module - Apt
    • None
    • Mac OS X 10.5.6, Java 1.6.0_07

    Description

      Doxia unnecessarily forces all APT source files to be encoded in ISO-8859-1. Files encoded in UTF-8 can have garbage characters as a result. Doxia should be able to autodetect the encoding of the APT file to prevent this problem, as it already does for XML (see DOXIA-133).

      A test case is attached. It includes two APT source files, one encoded in ISO-8859-1 and another encoded in UTF-8. Both contain the copyright symbol. To reproduce the problem, simply run "mvn site" on the project and open the target/site/test-utf8.html and target/site/test-iso-8859-1.html. The file encoded with ISO-8859-1 should display the copyright symbol correctly, while the one encoded with UTF-8 contains a garbage character immediately before the symbol.

      Attachments

        1. HelloWorld.zip
          3 kB
          Trevor Harmon

        Issue Links

          Activity

            People

              hboutemy Herve Boutemy
              vocaro Trevor Harmon
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: