Uploaded image for project: 'PDFBox'
  1. PDFBox
  2. PDFBOX-4392

PDF completely blow up the RAM on amazon instances

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.0.12
    • Fix Version/s: 2.0.14, 3.0.0 PDFBox
    • Component/s: Rendering
    • Labels:
      None

      Description

      Hi all

      The issue is pretty straightforward. I receive a lot of pdfs every day and render them. In most of the cases everything is OK, but PDFs which produces 

      WARN org.apache.pdfbox.pdmodel.graphics.color.PDICCBased - ICC profile is Perceptual, ignoring, treating as Display class

      working super long, and are super memory consumable. 

      It takes from 5 to 15 min on m5.large amazon instance. But attached PDF completely killed the instance. The java process is just killed by linux during processing with no exception in logs. 

      So could you please provide explanations what is going on with files with WARN message above, and how can I improve the rendering. 

       

      Here is my VM options 

      -Dorg.apache.pdfbox.rendering.UsePureJavaCMYKConversion=true -Xmx3G -Xms2G -Dsun.java2d.cmm=sun.java2d.cmm.kcms.KcmsServiceProvider"

      Also don't hesitate to ask me about more PDF, I have tones of them

       

      And also a question, does GPU have influence on rendering?

        Attachments

        1. 4392-prereadICC.patch
          4 kB
          Itai Shaked
        2. 2f0f8f77-7a85-416d-b5d2-47a07d1416d4_3.pdf
          15.79 MB
          Oleksandr Skoryi

          Issue Links

            Activity

              People

              • Assignee:
                tilman Tilman Hausherr
                Reporter:
                AlexFaster Oleksandr Skoryi
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: