Details
-
Bug
-
Status: Closed
-
Minor
-
Resolution: Fixed
-
None
-
None
Description
[imported from SourceForge]
http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1755543
Originally submitted by taywin on 2007-07-17 08:52.
I have been trying to extract text content from some PDF files which were created by Omnipage CaptureSDK V12.7 but the Java threw this exception. Is the problem from my files or it is from fontbox library? Any comment would be appreciated. Thank you.
Microsoft Windows XP [Version 5.1.2600]
(C) Copyright 1985-2001 Microsoft Corp.
C:\Documents and Settings\Owner>cd\intern-search\mysearch
C:\Intern-Search\mysearch>jrun-lucene MyPDF2
PDF File name (no extension): test
------ Text Data ------
Page number: 1
Exception in thread "main" java.lang.NoSuchMethodError: org.fontbox.cmap.CMapPar
ser.parse(Ljava/lang/String;Ljava/io/InputStream;)Lorg/fontbox/cmap/CMap;
at org.pdfbox.pdmodel.font.PDFont.parseCmap(PDFont.java:509)
at org.pdfbox.pdmodel.font.PDFont.encode(PDFont.java:380)
at org.pdfbox.util.PDFStreamEngine.showString(PDFStreamEngine.java:325)
at org.pdfbox.util.operator.ShowText.process(ShowText.java:64)
at org.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:
452)
at org.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java
:215)
at org.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:17
4)
at org.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:336)
at org.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:259
)
at org.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:216)
at MyPDF2.processDocument(MyPDF2.java:133)
at MyPDF2.main(MyPDF2.java:82)
C:\Intern-Search\mysearch>
[attachment on SourceForge]
http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&aid=1755543&file_id=237366
test.pdf (application/pdf), 24097 bytes
Omnipage CaptureSDK V12.7 patent file