Description
I get this with the attached file when using the non-sequential parser only:
Exception in thread "main" java.io.IOException: Error: Expected a long type at offset 1218571, instead got 'xref' at org.apache.pdfbox.pdfparser.BaseParser.readLong(BaseParser.java:1689) at org.apache.pdfbox.pdfparser.BaseParser.readObjectNumber(BaseParser.java:1617) at org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parseXrefObjStream(NonSequentialPDFParser.java:746) at org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parseXref(NonSequentialPDFParser.java:697) at org.apache.pdfbox.pdfparser.NonSequentialPDFParser.initialParse(NonSequentialPDFParser.java:480) at org.apache.pdfbox.pdfparser.NonSequentialPDFParser.parse(NonSequentialPDFParser.java:1013) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:951)
Attachments
Attachments
Issue Links
- relates to
-
TIKA-1442 Upgrade to PDFBox 1.8.8
- Closed