Details
Description
On the attached Word file, which opens fine with Word, the Tika parser throws the following error:
java.lang.NullPointerException
at org.apache.poi.xwpf.usermodel.XWPFSDTContentCell.<init>(XWPFSDTContentCell.java:49)
at org.apache.poi.xwpf.usermodel.XWPFSDTCell.<init>(XWPFSDTCell.java:35)
at org.apache.poi.xwpf.usermodel.XWPFTableRow.getTableICells(XWPFTableRow.java:147)
at org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator.extractTable(XWPFWordExtractorDecorator.java:359)
at org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator.extractIBodyText(XWPFWordExtractorDecorator.java:111)
at org.apache.tika.parser.microsoft.ooxml.XWPFWordExtractorDecorator.buildXHTML(XWPFWordExtractorDecorator.java:93)
at org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor.getXHTML(AbstractOOXMLExtractor.java:109)
at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:112)
at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:87)
Attachments
Attachments
Issue Links
- depends upon
-
TIKA-2116 Upgrade to POI 3.16-beta1 when available
- Resolved