Cocoon 3
  1. Cocoon 3
  2. COCOON3-5

Add an HTML2XHTML converter as Starter

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Minor Minor
    • Resolution: Fixed
    • Affects Version/s: 3.0.0-alpha-2
    • Fix Version/s: 3.0.0-alpha-2
    • Component/s: cocoon-optional
    • Labels:
      None

      Description

      This starter component for the pipeline is a component that transform an HTML content, taken by the specified URL, and transform it in XHTML or, at least, a well-formed XML document.
      So now the original document can be processed in the pipeline in various ways:
       * following links;
       * implementing crwalers;
       * easy transforming the original document in other various formats;
       * etc...

      I want to explain the need of this component with a testcase; last week I had to face a singular problem, realizing a simple service that takes in input an HTML page's URL, and transform it , through the Optimus' XSLT (http://microformatique.com/optimus - http://code.google.com/p/mf-optimus/source/browse/#svn/trunk/xsl) in an XML document that contains the original doc's Microformats, in an easier and more parsable formats.

        Activity

        Hide
        Reinhard Poetz added a comment -
        Thanks Simone! We applied your patch with some minor modification so that it runs with sitemaps too.
        Show
        Reinhard Poetz added a comment - Thanks Simone! We applied your patch with some minor modification so that it runs with sitemaps too.
        Hide
        Simone Tripodi added a comment -
        The attached patch contains an easy implementations that uses CyberNeko (http://nekohtml.sourceforge.net/).
        Like others generators, works with SAX apis starting from a ZAXParser instances and notifies SAX events to the xmlConsumer.
        A simple testcase has been also implemented.
        Show
        Simone Tripodi added a comment - The attached patch contains an easy implementations that uses CyberNeko ( http://nekohtml.sourceforge.net/ ). Like others generators, works with SAX apis starting from a ZAXParser instances and notifies SAX events to the xmlConsumer. A simple testcase has been also implemented.

          People

          • Assignee:
            Unassigned
            Reporter:
            Simone Tripodi
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development