Details
-
Bug
-
Status: Closed
-
Minor
-
Resolution: Workaround
-
1.19.1
-
None
-
None
Description
I am running tomcat based server for attachment extraction using apache tika-app-jar-1.19.1.(https://www.apache.org/dyn/closer.cgi/tika/tika-app-1.19.1.jar) . while trying to extract docx files(not for all docx files,only some files), tika throws following exception. JDK version is 1.8.0_162.
java.lang.NoClassDefFoundError: Could not initialize class org.openxmlformats.schemas.wordprocessingml.x2006.main.DocumentDocument at org.openxmlformats.schemas.wordprocessingml.x2006.main.DocumentDocument$Factory.parse(Unknown Source) at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:178) at org.apache.poi.ooxml.POIXMLDocument.load(POIXMLDocument.java:184) at org.apache.poi.xwpf.usermodel.XWPFDocument.(XWPFDocument.java:138) at org.apache.poi.xwpf.extractor.XWPFWordExtractor.(XWPFWordExtractor.java:60) at org.apache.poi.ooxml.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:228) at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:116) at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:110) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143) at org.apache.tika.Tika.parseToString(Tika.java:527)
at org.apache.tika.Tika.parseToString(Tika.java:602)