Details

    • Sub-task
    • Status: Patch Available
    • Major
    • Resolution: Unresolved
    • 3.0.0, 2.9.2
    • None
    • yarn
    • None

    Description

      Recently, we have observed serious event blocking in RM event dispatcher queue. After analysis of RM event monitoring data and RM event processing logic, we found that

      1) environment: a cluster with thousands of nodes

      2) RMNodeStatusEvent dominates 90% time consumption of RM event scheduler

      3) Meanwhile, RM event processing is in a single-thread mode, and It results in the low headroom of RM event scheduler, thus performance of RM.

      So we proposed a RM multi-thread event processing mechanism to improve RM performance.

      Attachments

        1. YARN-9927.005.patch
          13 kB
          Qi Zhu
        2. YARN-9927.004.patch
          11 kB
          Qi Zhu
        3. YARN-9927.003.patch
          11 kB
          Qi Zhu
        4. YARN-9927.002.patch
          7 kB
          Qi Zhu
        5. YARN-9927.001.patch
          17 kB
          hcarrot
        6. RM multi-thread event processing mechanism.pdf
          328 kB
          hcarrot

        Issue Links

          Activity

            People

              zhuqi Qi Zhu
              hcarrot hcarrot
              Votes:
              2 Vote for this issue
              Watchers:
              28 Start watching this issue

              Dates

                Created:
                Updated: