Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1375

Decrease memory consumption when extracting images from PDFs

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.6
    • Component/s: parser
    • Labels:
      None

      Description

      This patch applies changes made in PDFBOX-2101 to decrease memory consumption during extraction of embedded images. This also applies the recommendation by Tilman Hausherr on the PDFBox dev list to clear resources after handling each page.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                tallison Tim Allison
                Reporter:
                tallison Tim Allison
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: