Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-7262

Add a hierarchy into the ZKRMStateStore for delegation token znodes to prevent jute buffer overflow

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 2.6.0
    • 2.9.0, 3.0.0
    • None
    • None
    • Reviewed

    Description

      We've seen users who are running into a problem where the RM is storing so many delegation tokens in the ZKRMStateStore that the listing of those znodes is higher than the jute buffer. This is fine during operations, but becomes a problem on a fail over because the RM will try to read in all of the token znodes (i.e. call getChildren on the parent znode). This is particularly bad because everything appears to be okay, but then if a failover occurs you end up with no active RMs.

      There was a similar problem with the Yarn application data that was fixed in YARN-2962 by adding a (configurable) hierarchy of znodes so the RM could pull subchildren without overflowing the jute buffer (though it's off by default).
      We should add a hierarchy similar to that of YARN-2962, but for the delegation token znodes.

      Attachments

        1. YARN-7262.003.patch
          53 kB
          Robert Kanter
        2. YARN-7262.003.patch
          53 kB
          Robert Kanter
        3. YARN-7262.002.patch
          51 kB
          Robert Kanter
        4. YARN-7262.001.patch
          47 kB
          Robert Kanter

        Issue Links

          Activity

            People

              rkanter Robert Kanter
              rkanter Robert Kanter
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: