Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-1220

CMYK image cannot be extracted (empty file generated)

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.6.0
    • 1.7.0
    • None

    Description

      Extracting the attached PDF with the following command:
      java -jar app/target/pdfbox-app-1.7.0-SNAPSHOT.jar PDFToImage -imageType png /tmp/WO2011140338-Page25.pdf

      This generates image WO2011140338-Page251.png (attached).

      This is a great improvement over pdfbox 1.6.0, which generates four copies of the image (maybe because of the CMYK encoding?). Well done!

      However the image is still of quite poor quality, apparently lower than the actual image data in the PDF, when displayed with acrobat reader or evince 2.32.0 (screenshot attached too). It would be great if that could be fixed too.

      Attachments

        1. PDFBOX1220-WO2011140338-Page251.png
          196 kB
          Andreas Lehmkühler
        2. WO2011140338-Page251.png
          36 kB
          Daniel Bonniot de Ruisselet
        3. WO2011140338-Page25.pdf
          178 kB
          Daniel Bonniot de Ruisselet
        4. Screenshot-evince-WO2011140338 Page25.PDF.png
          340 kB
          Daniel Bonniot de Ruisselet

        Activity

          People

            lehmi Andreas Lehmkühler
            dbr Daniel Bonniot de Ruisselet
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: