Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
2.0.2, 2.0.6
-
None
-
Windows 7
Description
The PDFTextStripperByArea.extractRegions fails for the first page of the attached document.
java.io.IOException: Error: Could not find referenced cmap stream H
at org.apache.fontbox.cmap.CMapParser.getExternalCMap(CMapParser.java:415)
at org.apache.fontbox.cmap.CMapParser.parsePredefined(CMapParser.java:87)
at org.apache.pdfbox.pdmodel.font.CMapManager.getPredefinedCMap(CMapManager.java:54)
at org.apache.pdfbox.pdmodel.font.PDType0Font.readEncoding(PDType0Font.java:181)
at org.apache.pdfbox.pdmodel.font.PDType0Font.<init>(PDType0Font.java:129)
at org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:83)
at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:123)
at org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:60)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:815)
at com.geensoft.traceability.converters.pdf.strippers.AreaStripper.processOperator(AreaStripper.java:324)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:472)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:446)
at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:149)
at org.apache.pdfbox.text.PDFTextStreamEngine.processPage(PDFTextStreamEngine.java:136)
at org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:391)
at org.apache.pdfbox.text.PDFTextStripperByArea.extractRegions(PDFTextStripperByArea.java:132)