Description
Parsing a very simple html like
<!DOCTYPE html>
<html lang="en">
<head>
<title>Page Title</title>
</head>
<body>
<h1 align="left">My First Heading</h1>
<p>My first paragraph.</p>
</body>
</html>
you won't be able to access the html tag's attributes (here lang="en") in the ContentHandler :
*in the method startElement(String ns, String localName, String name,
Attributes atts), atts is empty.
*Moreover it seems that the html tag's attributes are not passed trough the HtmlMapper.mapSafeAttribute method too.
Attachments
Issue Links
- links to