Description
Documents in the .docx format may contain smart-tags (of element type w:smartTag). Such a smart-tag will surround the tagged text (found in element w:r).
The OOXMLParser does not extract the text contained within smart-tags. [Example document to follow]
Attachments
Attachments
Issue Links
- is related to
-
TIKA-423 Parse docx and output to text file missing words
- Resolved