Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
On https://bz.apache.org/bugzilla/show_bug.cgi?id=61354, kramachandran@commvault.com reported that our DOM parser was missing "body" sections after the first body section in docx. PJ Fanning applied the patch, and this will be available when we upgrade to POI 3.17-beta2.
As a side note, the experimental SAX parser was correctly extracting all text from the triggering document.
Attachments
Issue Links
- depends upon
-
TIKA-2429 Upgrade to POI 3.17-final when available
- Resolved