Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-958

convertToImage mangles images which were in the PDF

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 1.2.1, 1.4.0, 1.5.0
    • 1.6.0
    • None
    • None
    • RHEL5 and WinXP, java version "1.6.0_23"

    Description

      Of the PDFs we've tried running through PDFBox and generating page images, a number of them (coming from disparate sources and method of creation) seem to produce images where an image that was embedded in the page of the PDF shows somewhat mangled. It seems to be divided by horizontal stripes, where some stripes look normal, others seem to have some kind of "smearing" effect going on. See attached images and original PDF (image is of page 13).

      I marked this as critical as we are trying to use PDFBox in a project where page images are crucial, and inability to produce reasonable looking page images is pretty much a deal breaker.

      The code we use to extract the images looks more or less like the following:

      BufferedImage image = page.convertToImage();

      SmartDeferredFileOutputStream outStream = new SmartDeferredFileOutputStream();
      String[] writerFormatNames = ImageIO.getWriterFormatNames();
      ImageIO.write(image, "jpeg", outStream);
      outStream.close()

      We've also tried specifying "png". In both "jpg" and "png" cases we get an image file that is indeed the correct format, and both images look exactly the same.

      Attachments

        1. Image of Page 13.jpeg
          266 kB
          Eric Schwarzenbach
        2. Image of Page 13.png
          752 kB
          Eric Schwarzenbach
        3. PDFBOX958-WrycanLoremIpsumTest.pdf
          1.83 MB
          Andreas Lehmkühler

        Activity

          People

            lehmi Andreas Lehmkühler
            ericjs Eric Schwarzenbach
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: