Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-1508

Extracting page causes incorrect clipping

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Not A Problem
    • Affects Version/s: 1.7.1
    • Fix Version/s: None
    • Component/s: Parsing, Swing GUI
    • Labels:
      None
    • Environment:
      Windows 7, Windows XP, Windows Server 2008

      Description

      I have a compressed pdf from which i extract pages (each page will become an individual pdf file). The extracted pages are clipped incorrectly (text is cut), as opposed to original pdf that is not clipped. I traced it down to a missing mediabox attribute in the extracted pages, which exists in the original file as an attribute on all pages. Using the same file, but uncompressed, the extracted pages are not cut and the mediabox attribute is present.

      The main code (without initializations and checks) used to load and extract pages is the following:

      temp = new File("e:/temp.tmp");
      rand = new RandomAccessFile(temp,"rw");
      doc = PDDocument.loadNonSeq(file,rand);
      PDPage page = (PDPage) doc.getPrintable(pageIndex);
      PDDocument newDoc = new PDDocument();
      newDoc.importPage(page);
      newDoc.close();
      doc.close();
      rand.close();
      temp.delete();

        Attachments

        1. files.zip
          200 kB
          Adina Toma

          Activity

            People

            • Assignee:
              lehmi Andreas Lehmkühler
              Reporter:
              actoma Adina Toma
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: