Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-1220

CMYK image cannot be extracted (empty file generated)

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.6.0
    • Fix Version/s: 1.7.0
    • Component/s: None
    • Labels:

      Description

      Extracting the attached PDF with the following command:
      java -jar app/target/pdfbox-app-1.7.0-SNAPSHOT.jar PDFToImage -imageType png /tmp/WO2011140338-Page25.pdf

      This generates image WO2011140338-Page251.png (attached).

      This is a great improvement over pdfbox 1.6.0, which generates four copies of the image (maybe because of the CMYK encoding?). Well done!

      However the image is still of quite poor quality, apparently lower than the actual image data in the PDF, when displayed with acrobat reader or evince 2.32.0 (screenshot attached too). It would be great if that could be fixed too.

        Attachments

        1. WO2011140338-Page251.png
          36 kB
          Daniel Bonniot de Ruisselet
        2. WO2011140338-Page25.pdf
          178 kB
          Daniel Bonniot de Ruisselet
        3. Screenshot-evince-WO2011140338 Page25.PDF.png
          340 kB
          Daniel Bonniot de Ruisselet
        4. PDFBOX1220-WO2011140338-Page251.png
          196 kB
          Andreas Lehmkühler

          Activity

            People

            • Assignee:
              lehmi Andreas Lehmkühler
              Reporter:
              dbr Daniel Bonniot de Ruisselet
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: