Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-9714

ZooKeeper connection in ZKRMStateStore leaks after RM transitioned to standby

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Reviewed

    Description

      Recently RM full GC happened in one of our clusters, after investigating the dump memory and jstack, I found two places in RM may cause memory leaks after RM transitioned to standby:

      1. Release cache cleanup timer in AbstractYarnScheduler never be canceled.
      2. ZooKeeper connection in ZKRMStateStore never be closed.

      To solve those leaks, we should close the connection or cancel the timer when services are stopping.

      Attachments

        1. YARN-9714.005.patch
          3 kB
          Tao Yang
        2. YARN-9714.004.patch
          3 kB
          Tao Yang
        3. YARN-9714.003.patch
          1 kB
          Tao Yang
        4. YARN-9714.002.patch
          1 kB
          Tao Yang
        5. YARN-9714.001.patch
          4 kB
          Tao Yang

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Tao Yang Tao Yang
            Tao Yang Tao Yang
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment