Description
As simple as it gets, link and iframe tags were never implemented in LinkContentHandler. NUTCH-1233 kind of requires it.
Attachments
Attachments
Issue Links
- is depended upon by
-
NUTCH-1233 Rely on Tika for outlink extraction
- Closed
-
NUTCH-2210 Upgrade to Tika 1.12
- Closed
- is related to
-
TIKA-1937 LinkContentHandler skips script tags
- Resolved
- is required by
-
NUTCH-1233 Rely on Tika for outlink extraction
- Closed
- relates to
-
TIKA-1937 LinkContentHandler skips script tags
- Resolved
-
TIKA-503 Add a ContentHandler for collecting links from parser output
- Resolved