Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
2.0.24, 3.0.0 PDFBox
-
Patch
Description
for some special pdf files like the one I attached, some text is missing from text extraction. after some debug and tests, found out that this can be fixed if we use Cmap from TrueTypeFont too.
I will submit a patch soon
Attachments
Attachments
Issue Links
- is related to
-
PDFBOX-5331 Text "820-01869-U-A" is omitted from PDF doc
- Closed