Description
The attached file was created in Google Docs with an image inside and saved as an .odt file. After saving, I opened the file with LibreOffice and added a hyperlink to the image.
When I parse the file with Tika, neither LinkContentHandler or ToXMLContentHandler show any trace of the hyperlink.
The link is clickable when I open the document, and inside content.xml as :
<draw:a xlink:type="simple" xlink:href="http://example.test/">
I tried enabling all options in OfficeParserConfig and OOXMLParser but the link is still not extracted.