[PDFBOX-1152] Gets scrambled japanese text while reading a PDF file - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 1.6.0
Fix Version/s: 2.0.0
Component/s: Text extraction
Labels:
- PDFBox
Environment:
Windows XP Service Pack 3, P4, 1GB

Description

During conversion of a Japanese PDF file to XML the output Japanese text gets scrambled.

Attachments

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SamplePDF.pdf
31/Oct/11 05:37
8 kB
Suresh Somanathan
SamplePDF.xml
31/Oct/11 05:37
0.1 kB
Suresh Somanathan

Activity

People

Assignee:: Andreas Lehmkühler

Reporter:: Suresh Somanathan

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 31/Oct/11 05:27

Updated:: 17/Mar/16 19:07

Resolved:: 23/Oct/14 17:03

Time Tracking

Estimated:

24h

Remaining:

24h

Logged:

Not Specified