Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.8.6, 2.0.0
-
None
-
None
Description
1.8.6 can't extract the Unicode due to failing to map the UCS2 CMap for 90ms-RKSJ-V.- 2.0 extracts the text but can't handle the vertical layout
Also see the file from PDFBOX-2294 which contains both horizontal and vertical text.
Attachments
Attachments
Issue Links
- is duplicated by
-
PDFBOX-2879 Wrong vertical text extraction for apache PDFBox 2.0.0
- Closed
- is related to
-
PDFBOX-800 Wrong text extract from vertical textboxes in pdf files
- Open
- relates to
-
PDFBOX-2711 Japanese text not extracted
- Closed