Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1471

OOM with corrupt PDF file

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Blocker
    • Resolution: Done
    • 1.6
    • None
    • general
    • None
    • Linux, JVM 1.8.0_25-b17, 64-bit

    Description

      Use of PDFBox 1.8.6 by Tika 1.6 is causing OOM errors with corrupt PDF files, due to a bug in PDFBox, see PDFBOX-2493. This makes Tika 1.6 unusable from inside a long-running webapp and I've had to revert to Tika 1.5. Although 1.5 also throws errors with the corrupt file it does not cause OOM errors.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              alanbur Alan Burlison
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: