Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-9080

Flink Scheduler goes OOM, suspecting a memory leak

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • 1.4.0
    • None
    • Runtime / Coordination
    • None

    Description

      Running FLink version 1.4.0. on mesos,scheduler running along  with job manager in single container, whereas task managers running in seperate containers.

      Couple of jobs were running continously, Flink scheduler was working properlyalong with task managers. Due to some change in data, one of the jobs started failing continuously. In the meantime,there was a surge in  flink scheduler memory usually eventually died out off OOM

       

      Memory dump analysis was done, 

      Following were findings

      •  Majority of top loaded packages retaining heap indicated towards Flinkuserclassloader, glassfish(jersey library), Finalizer classes. (Top level package image)
      • Top level classes were of Flinkuserclassloader, (Top Level class image)
      • The number of classes loaded vs unloaded was quite less  PFA,inspite of adding jvm options of -XX:+UseConcMarkSweepGC -XX:+CMSClassUnloadingEnabled , PFAclassloaded vs unloaded graph, scheduler was restarted 3 times
      • There were custom classes as well which were duplicated during subsequent class uploads

      PFA all the images of heap dump.  Can you suggest some pointers on as to how to overcome this issue.

       

       

      Attachments

        1. Top Level packages.JPG
          86 kB
          Rohit Singh
        2. Top level classes.JPG
          46 kB
          Rohit Singh
        3. Screenshot 2018-12-18 at 12.14.11.png
          176 kB
          Nawaid Shamim
        4. classesloaded vs unloaded.png
          48 kB
          Rohit Singh
        5. class_loader_leak.png
          93 kB
          Michel Davit

        Issue Links

          Activity

            People

              srichter Stefan Richter
              rohsing Rohit Singh
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: