Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Not A Problem
-
1.2.1
-
None
-
None
-
None
-
ubuntu 10.4
java version "1.6.0_20"
Java(TM) SE Runtime Environment (build 1.6.0_20-b02)
Java HotSpot(TM) 64-Bit Server VM (build 16.3-b01, mixed mode)
Description
some fields in PDDocumentInformation are unreadable.
ex. parsing this pdf from http://delhigovt.nic.in/election/delhi/U05/A061EN/A0610042.pdf, I got the following
TITLE: 9¸Ð1Ü
AUTHOR: )©Ë&ÄãJá7eà
CREATOR:8Ð!âä/f²Inmï¥kX¿
KEYWORDS:null
producer: )®Í*Ã- |é0~û$“H'7¯µpC¥ÞQKÄÜÞ
SUBJECT:null
trapped: null
Attachments
Attachments
Issue Links
- blocks
-
TIKA-389 Garbled metadata when dealing with encrypted PDF files.
- Closed
- is duplicated by
-
PDFBOX-858 Metadata extraction broken on some PDF files
- Closed