Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-8692

Support node utilization metrics for SLS

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.2.0
    • None
    • None

    Description

      The distribution of node utilization is an important healthy factor for the YARN cluster, related metrics in SLS can be used to evaluate the scheduling effects and optimize related configurations. 

      To implement this improvement, we need to do things as below:

      (1) Add input configurations (contain avg and stddev for cpu/memory utilization ratio) and generate utilization samples for tasks, not include AM container cause I think it's negligible.

      (2) Simulate containers and node utilization within node status. 

      (3) calculate and generate the distribution metrics and use standard deviation metric (stddev for short) to evaluate the effects(smaller is better).  

      (4) show these metrics on SLS simulator page like this:

      For Node memory/CPU utilization distribution graphs, Y-axis is nodes number, and P0 represents 0%~9% utilization ratio(containers-utilization / node-total-resource), P1 represents 10%~19% utilization ratio, P2 represents 20%~29% utilization ratio, ..., at last P9 represents 90%~100% utilization ratio. 

      Attachments

        1. image-2018-08-21-18-04-22-749.png
          105 kB
          Tao Yang
        2. YARN-8692.001.patch
          38 kB
          Tao Yang

        Issue Links

          Activity

            People

              Tao Yang Tao Yang
              Tao Yang Tao Yang
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: