Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-1490

pdf page => inline image not converted

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.7.1
    • Fix Version/s: 1.8.0
    • Component/s: Parsing
    • Labels:
    • Environment:
      JDK 6 32bit, Windows 7 prof 64-bit

      Description

      Some inline images from pdf page are be missed after converting page to image

      extracts of the logs are:

      [WARN] 2013-01-10 10:45:26,491 [main]
      org.apache.pdfbox.util.operator.SetNonStrokingColor - Not supported
      colorspace Indexed within operator scn
      [ERROR] 2013-01-10 10:45:26,507 [main]
      org.apache.pdfbox.pdmodel.graphics.xobject.PDPixelMap -
      java.lang.IndexOutOfBoundsException: Index: 12, Size: 12
      java.lang.IndexOutOfBoundsException: Index: 12, Size: 12
      at java.util.ArrayList.RangeCheck(ArrayList.java:547)
      at java.util.ArrayList.get(ArrayList.java:322)
      at org.apache.pdfbox.io.RandomAccessBuffer.seek(RandomAccessBuffer.java:84)
      at
      org.apache.pdfbox.io.RandomAccessFileOutputStream.write(RandomAccessFileOutputStream.java:116)
      at
      org.apache.pdfbox.filter.RunLengthDecodeFilter.decode(RunLengthDecodeFilter.java:92)
      at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:279)
      at org.apache.pdfbox.cos.COSStream.doDecode(COSStream.java:221)
      at org.apache.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:156)
      at
      org.apache.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:214)
      at org.apache.pdfbox.pdmodel.common.PDStream.getByteArray(PDStream.java:468)
      at
      org.apache.pdfbox.pdmodel.graphics.xobject.PDPixelMap.getRGBImage(PDPixelMap.java:185)
      at org.apache.pdfbox.util.operator.pagedrawer.Invoke.process(Invoke.java:83)
      at
      org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:554)
      at
      org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:268)
      at
      org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:235)
      at
      org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:215)
      at org.apache.pdfbox.pdfviewer.PageDrawer.drawPage(PageDrawer.java:125)
      at org.apache.pdfbox.pdmodel.PDPage.convertToImage(PDPage.java:757)
      .....
      [WARN] 2013-01-10 10:45:26,883 [main]
      org.apache.pdfbox.pdmodel.graphics.color.PDColorState - Unable to create
      the color instance [0.0, 0.0, 0.0] in color space Indexed{ }; guessing
      color ...
      Interpretating as RGB
      java.io.IOException: Not implemented
      at
      org.apache.pdfbox.pdmodel.graphics.color.PDIndexed.createColorSpace(PDIndexed.java:116)
      at
      org.apache.pdfbox.pdmodel.graphics.color.PDColorSpace.getJavaColorSpace(PDColorSpace.java:85)
      at
      org.apache.pdfbox.pdmodel.graphics.color.PDColorState.createColor(PDColorState.java:180)
      ....
      

      Full debug log messages are available under : http://pastebin.com/iMJS7guS
      The converted image can be seen under http://postimage.org/image/kl0hfrk0b/

      If more information is required, please let me know!
      Output from "WriteDecodedDoc" not possible because
      "java.lang.IndexOutOfBoundsException: Index: 12, Size: 12" has been
      thrown...

      Best Regards,
      Sascha

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                lehmi Andreas Lehmkühler
                Reporter:
                s.fischer Sascha Fischer
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: