Details
-
Improvement
-
Status: Closed
-
Major
-
Resolution: Fixed
-
1.6.0
-
None
-
Fedora 13 Linux
Description
Apparently a PDF is able to contain multiple files (like a Zip file); it's called
a PDF Package, described at
http://help.adobe.com/en_US/Reader/8.0/help.html?content=WSE034CA46-D08F-4fff-AA3C-FF04510DAEF0.html
I have a simple example PDF Package, containing two sub-PDFs, but ExtractText
fails to extract their text.
It does run successfully (no exceptions), but the text it extracts is just the boilerplate text
saying you should upgrade to Adobe Acrobat version 8 or later to view this PDF.