Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-9509

Capped cpu usage with cgroup strict-resource-usage based on a mulitplier

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • nodemanager

    Description

      Add a multiplier configuration on strict resource usage to authorize container to use spare cpu up to a limit.
      Currently with strict resource usage you can't get more than what you request which is sometime not good for jobs that doesn't have a constant usage of cpu (for ex. spark jobs with multiple stages).
      But without strict resource usage we have seen some bad behaviour from our users that don't tune at all their needs and it leads to some containers requesting 2 vcore but constantly using 20.
      The idea here is to still authorize containers to get more cpu than what they request if some are free but also to avoid too big differencies so SLA on jobs is not breached if the cluster is full (at least increase of runtime is contain)

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              nfraison.criteo Nicolas Fraison
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 10m
                  10m