Description
Springs from my question here (http://stackoverflow.com/questions/42031237/modify-apache-tika-parsing-of-old-1997-2003-ms-word-docs) ... I have improved the class OpenDocumentContentParser so that it puts footnotes/endnotes at the end of the line to which they belong and doesn't break up the line in question. As with .docx parsing the notes can be linked to the reference easily. The respondee in Stack Overflow suggested I open an issue here...
Attachments
Attachments
Issue Links
- is related to
-
TIKA-2242 opendocument parsing produces malformed xml
- Resolved