Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
2.0.32, 3.0.3 PDFBox, 4.0.0
Description
As reported by Pascal Schumacher in the users mailing list
https://lists.apache.org/thread/yb42j9s5vp8jsjog9msplbc05y1xqwv3
java.lang.IllegalArgumentException: Parameter must be 1-based, but is 0
at org.apache.pdfbox.text.PDFTextStripper.setStartPage(PDFTextStripper.java:956)
at org.apache.pdfbox.text.PDFTextStripperByArea.extractRegions(PDFTextStripperByArea.java:117)
this is because of this earlier seemingly "harmless" commit
https://github.com/apache/pdfbox/commit/5c0abf94367c12c9ac0b464046784d456ce4caf5
that broke PDFTextStripperByArea because it has two calls with 0 parameter.
This wasn't discovered because we have no tests for PDFTextStripperByArea 😬
Attachments
Issue Links
- is related to
-
TIKA-4296 "Parameter must be 1-based, but is -1" when using Tika with PDFBox 2.0.32
- Resolved