[PDFBOX-955] Can't extract b/w images from PDF - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Minor
Resolution: Fixed
Affects Version/s: 1.4.0
Fix Version/s: 1.6.0
Component/s: None
Labels:
- extract
Environment:
Windows XP prof, Java 1.6.0_22, Netbeans 6.9.1

Description

I wrote a test application using org.apache.pdfbox.ExtractImages to... extract images as PNG. (This is the start of something bigger, which involves making a statistic about the content of over a million pages within PDF files) However all images I get are all black or all white when I test on our own PDF files. I did get correct images from a file that had color images. To extract, I tried page.convertToImage() and then writing with ImageIO.write(), but I also tried using PDFImageWriter, neither had success for b/w images.

The sample PDF is not confidential; it does give a warning "getRGBImage returned NULL" but other PDFs that don't give the warning (but are confidential) also fail.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

ccitt4-cib-test.pdf
14/Aug/13 17:39
10 kB
Tilman Hausherr
ccitt4-cib-test-01.png
14/Aug/13 17:39
20 kB
Tilman Hausherr
d0000040.pdf
02/Feb/11 14:00
8 kB
Tilman Hausherr
d0000040-01.png
02/Feb/11 14:00
9 kB
Tilman Hausherr
ExtractImages.java
02/Feb/11 14:00
4 kB
Tilman Hausherr
PDFBOX955-d00000401.png
16/Jun/11 17:28
44 kB
Andreas Lehmkühler
PDFBOX955-photo1.png
16/Jun/11 17:28
337 kB
Andreas Lehmkühler
photo.jpg
11/May/11 10:03
14 kB
Roel Pieters
photo.pdf
11/May/11 10:03
333 kB
Roel Pieters

Issue Links

is duplicated by

PDFBOX-1018 Remove imageIO dependency (was: PDPage convertToImage bug creates white images from black and white pdf files.)

Closed

PDFBOX-794 PDPage convertToImage generates white image with no contents

Closed

Activity

People

Assignee:: Andreas Lehmkühler

Reporter:: Tilman Hausherr

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 02/Feb/11 13:57

Updated:: 18/Aug/13 16:35

Resolved:: 18/Aug/13 16:35