Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Won't Fix
-
1.15
-
None
-
None
Description
There are cases when legacy parsers successfully parse documents on which Tika fails. I am attaching a list of examples of such documents. Nutch allows use of more than one parser on a document, in a sequence, until the document has been parsed successfully. Thus, old parsers can be combined with Tika to achieve better parsing success rate, at least until Tika is perfect.