Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
1.8.8, 2.0.0
-
None
-
None
Description
ExtractText does not return the text content of this PDF. There are just a few real characters when running 1.8.8, and none with today's 2.0.0 snapshot.
I also attach the output from pdftotext 0.26.5 (from poppler-utils), which seems to get it mostly right.
Attachments
Attachments
Issue Links
- is related to
-
PDFBOX-2272 Can't extract vertical text correctly
- Open
- relates to
-
PDFBOX-2509 Korean Text font substitution issues
- Closed