Description
The most common exception in the first run of 2.0.0-trunk against govdocs1 is this:
java.lang.NullPointerException at org.apache.pdfbox.pdmodel.interactive.form.PDNonTerminalField.getValueAsString(PDNonTerminalField.java:181) at org.apache.tika.parser.pdf.PDF2XHTML.addFieldString(PDF2XHTML.java:615) at org.apache.tika.parser.pdf.PDF2XHTML.processAcroField(PDF2XHTML.java:580) at org.apache.tika.parser.pdf.PDF2XHTML.extractAcroForm(PDF2XHTML.java:567) at org.apache.tika.parser.pdf.PDF2XHTML.endDocument(PDF2XHTML.java:201) at org.apache.pdfbox.text.PDFTextStripper.writeText(PDFTextStripper.java:250) at org.apache.tika.parser.pdf.PDF2XHTML.process(PDF2XHTML.java:137) at org.apache.tika.parser.pdf.PDFParser.parse(PDFParser.java:132)
Attachments
Attachments
Issue Links
- is related to
-
TIKA-1285 Upgrade to PDFBox 2.0.0 when available
- Closed