[PDFBOX-1340] i got wrong characters when i extract some chinese pdf files - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 1.7.0
Fix Version/s: 1.7.1
Component/s: Text extraction
Labels:
None
Environment:
windows, java1.6

Description

for pdfbox1.6,i can extract the right chinese, but some pages are not right, so i transform to pdfbox1.7, but the version get all wrong, it seems like Korean

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

tt.unc.pdf
14/Jun/12 08:32
445 kB
linqiang
ASF.LICENSE.NOT.GRANTED--screenshot-1.jpg
14/Jun/12 08:36
184 kB
linqiang

Activity

People

Assignee:: Andreas Lehmkühler

Reporter:: linqiang

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 14/Jun/12 08:29

Updated:: 25/Jul/12 06:01

Resolved:: 15/Jul/12 16:12