Details
-
Type:
Bug
-
Status: Closed
-
Priority:
Major
-
Resolution: Fixed
-
Affects Version/s: 1.8.8, 2.0.0
-
Fix Version/s: 2.0.0
-
Component/s: None
-
Labels:None
Description
ExtractText does not return the text content of this PDF. There are just a few real characters when running 1.8.8, and none with today's 2.0.0 snapshot.
I also attach the output from pdftotext 0.26.5 (from poppler-utils), which seems to get it mostly right.
Attachments
Issue Links
- is related to
-
PDFBOX-2272 Can't extract vertical text correctly
-
- Open
-
- relates to
-
PDFBOX-2509 Korean Text font substitution issues
-
- Closed
-