[PDFBOX-1508] Extracting page causes incorrect clipping - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Not A Problem
Affects Version/s: 1.7.1
Fix Version/s: None
Component/s: Parsing, Swing GUI
Labels:
None
Environment:
Windows 7, Windows XP, Windows Server 2008

Description

I have a compressed pdf from which i extract pages (each page will become an individual pdf file). The extracted pages are clipped incorrectly (text is cut), as opposed to original pdf that is not clipped. I traced it down to a missing mediabox attribute in the extracted pages, which exists in the original file as an attribute on all pages. Using the same file, but uncompressed, the extracted pages are not cut and the mediabox attribute is present.

The main code (without initializations and checks) used to load and extract pages is the following:

temp = new File("e:/temp.tmp");
rand = new RandomAccessFile(temp,"rw");
doc = PDDocument.loadNonSeq(file,rand);
PDPage page = (PDPage) doc.getPrintable(pageIndex);
PDDocument newDoc = new PDDocument();
newDoc.importPage(page);
newDoc.close();
doc.close();
rand.close();
temp.delete();

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

files.zip
05/Feb/13 13:27
200 kB
Adina Toma

Activity

People

Assignee:: Andreas Lehmkühler

Reporter:: Adina Toma

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 05/Feb/13 13:24

Updated:: 21/Mar/13 09:58

Resolved:: 09/Mar/13 15:40