Uploaded image for project: 'Forrest'
  1. Forrest
  2. FOR-143

Encoding of HTML files should be configurable

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Minor
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: Core operations
    • Labels:
      None

      Description

      From: Fabrice Bacchella


      By default, HTML files are encoded in ISO-8859-1 because of the way the Cocoon's HTML serialiser is declared in sitemap.xmap :

      <map:serializer name="html" mime-type="text/html"
      src="org.apache.cocoon.serialization.HTMLSerializer">
        <doctype-public>-//W3C//DTD HTML 4.01 Transitional//EN</doctype-public>

      <doctype-system>http://www.w3.org/TR/html4/loose.dtd&lt;/doctype-system>
      </map:serializer>

      The default one is such a case is ISO-8859-1. And it even add a META inthe HTML result. But here in French, ISO-8859-15 is better because of the =AB e dans l'o =BB character : =9C. In Russian or Asia, the
      situation is even worse.

      Is that possible to add a properties in forrest.properties saying for example :
      project.encoding=UTF-8

      and adding a <encoding>@project.encoding@</encoding> element in the
      HTML serializer ?

      And in a more general way, where can I find a document about the way properties are used by forrest ? Which files use them ? How to add a
      new one ?

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              cheche Juan Jose Pablos
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: