Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-7262

Add a hierarchy into the ZKRMStateStore for delegation token znodes to prevent jute buffer overflow

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.6.0
    • Fix Version/s: 2.9.0, 3.0.0
    • Component/s: None
    • Labels:
      None
    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      We've seen users who are running into a problem where the RM is storing so many delegation tokens in the ZKRMStateStore that the listing of those znodes is higher than the jute buffer. This is fine during operations, but becomes a problem on a fail over because the RM will try to read in all of the token znodes (i.e. call getChildren on the parent znode). This is particularly bad because everything appears to be okay, but then if a failover occurs you end up with no active RMs.

      There was a similar problem with the Yarn application data that was fixed in YARN-2962 by adding a (configurable) hierarchy of znodes so the RM could pull subchildren without overflowing the jute buffer (though it's off by default).
      We should add a hierarchy similar to that of YARN-2962, but for the delegation token znodes.

        Attachments

        1. YARN-7262.001.patch
          47 kB
          Robert Kanter
        2. YARN-7262.002.patch
          51 kB
          Robert Kanter
        3. YARN-7262.003.patch
          53 kB
          Robert Kanter
        4. YARN-7262.003.patch
          53 kB
          Robert Kanter

          Issue Links

            Activity

              People

              • Assignee:
                rkanter Robert Kanter
                Reporter:
                rkanter Robert Kanter
              • Votes:
                0 Vote for this issue
                Watchers:
                6 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: