Details
-
Sub-task
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
None
-
None
Description
Several parsers can handle the same mime type, and we are currently ordering which parser is chosen (roughly) by the alphabetic order of the parser class name.
Let's allow users to configure strategies for picking parsers.
See and contribute to full discussion here: http://wiki.apache.org/tika/CompositeParserDiscussion
Attachments
Issue Links
- depends upon
-
TIKA-2084 Create resettable OutputStream to support "backoff on exception" strategy
- Open
- duplicates
-
TIKA-669 Backup plan for parsing
- Closed
-
TIKA-288 Support override parsers in AutoDetectParser
- Closed
- is related to
-
TIKA-1445 Figure out how to add Image metadata extraction to Tesseract parser
- Resolved