Details
-
Bug
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
0.1.0
-
None
-
None
Description
While the TikaHTMLParser can parse pdfs, docs, etc, it returns them in an HTMLified format. Solr blows up on that format, and it isn't always necessary to do this step anyway.
Attachments
Attachments
Issue Links
- is blocked by
-
DROIDS-71 Use latest version of Tika
- Closed