Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-602

PDFBox performance issue: PDPage tweaks

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.8.0-incubator
    • 1.0.0
    • PDModel
    • None
    • All

    Description

      During parsing / text extraction the PDPage class' getMediaBox() and getParent() methods are accessed many times. They are currently slow because the code recalculates the results each time.

      These should instead be persisted in instance fields so that they are only calculated once unless the underlying object state has changed.

      I will attach a version of the file that includes the fix.

      Attachments

        1. PDPage.java
          25 kB
          Mel Martinez

        Activity

          People

            jukkaz Jukka Zitting
            m.martinez Mel Martinez
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: