Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-4396

Memory leak due to soft reference caching

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Workaround
    • 2.0.12
    • None
    • None
    • None
    • JDK10; G1

    Description

      In a heap dump, it appears that DefaultResourceCache is retaining 5.3 GB of memory due to buffered images (via PDImageXObject). I suspect that G1 is not collecting soft references across all regions before it out-of-memory errors.

      In PDFBOX-4389, I discovered very slow PDDocument#load times due to a JDK10 I/O bug. Previously I was loading the document to render each page, but this took 1.5 minutes. To work around that bug I reused the document instance across pages. This seems to have fail because the pages were cached and not cleared by the GC.

      The DefaultResourceCache does not prune its cache entries when the soft references are collected. Like WeakHashMap, it should use a ReferenceQueue, poll it on every access, and prune accordingly.

      Thankfully PDDocument#setResourceCache exists. For now I am going to reset the cache to a new instance after a page has been rendered. The entries should no longer be reachable and be GC'd more aggressively. If that doesn't work, I'll either replace the cache (e.g. with Caffeine) or disable it by setting the instance to null.

      I think the desired fix is to prune the DefaultResourceCache and, ideally, reconsider usage of soft references (as they tend to be poor in practice). 

      Attachments

        1. #2 - memory leak.png
          1.29 MB
          Ben Manes
        2. #2 - memory leak 2.png
          1.35 MB
          Ben Manes
        3. memory leak.png
          1.35 MB
          Ben Manes
        4. memory leak 2.png
          1.31 MB
          Ben Manes

        Issue Links

          Activity

            People

              Unassigned Unassigned
              ben.manes Ben Manes
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: