Details
-
Improvement
-
Status: Closed
-
Minor
-
Resolution: Done
-
None
-
None
Description
Following updates:
Updated tagsoup lib to 1.2.1 which has the following modifications
- DOCTYPE is now recognized even in lower case.
- We make sure to buffer the reader, eliminating a long-standing bug that would crash on certain inputs, such as & followed by CR+LF.
- The HTML scanner's table is precompiled at run time for efficiency, causing a 4x speedup on large input documents.
- ]] within a CDATA section no longer causes input to be discarded.
- Remove bogus newline after printing children of the root element.
- Allow the noscript element anywhere, the same as the script element.
- Updated to the 2011 edition of the W3C character entity list.
Additionally:
Updated license with new home page for tagsoup
Updated annotations to OSGi annotations
Added the ability to specify additional features/properties for the parser
Documented available settings
Javadoc fixed
Prepared for different parsers by renaming HtmlParserImpl and adding component properties
Configuration improved
Attachments
Attachments
Issue Links
- relates to
-
SLING-7557 Add initial integration tests
- Closed
- links to