Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.28.1, 2.3.0
-
None
-
None
-
The problem can be reproduced under (Windows + Java8). However, the problem does not appear to be environment specific.
Description
Attempting to parse the attached "map.pdf" causes the Tika parser to hang due to an infinite loop involving "PDFStreamParser" logic.
This problem occurs in both tika-app 1.28.1 and 2.3.0.
It is also worth noting that Acrobat itself will become unresponsive if attempting to open this document.
To reproduce the problem, just run:
java -jar tika-app-1.28.1.jar map.pdf
Attachments
Attachments
Issue Links
- is related to
-
PDFBOX-5415 Infinite loop in ExtractText in 2.x branch on a specific pdf
- Open