Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
1.2
-
None
Description
In org.apache.any23.extractor.html.EmbeddedJSONLDExtractor.extractJSONLDScript(), I think this line :
List<Node> scriptNodes = DomUtils.findAll(in, "/HTML/HEAD/SCRIPT");
is too restrictive. scripts containing json-ld can be placed anywhere in the page, and actually some CMS/Wordpress plugin inserting JSON-LD are generating their output in the body, not in the head.
Attachments
Attachments
Issue Links
- links to