Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.17
-
None
Description
In tika-parsers/pdf/ would it be possible to make PDF2XHTML and AbstractPDF2XHTML public so they can be inherited. We would like to capture some additional font and layout information when outputting XHTML. We would like to inherit PDF2XHTML and override some of the functions to do what we need.