Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-8320

[Umbrella] Support CPU isolation for latency-sensitive (LS) service

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • nodemanager
    • None

    Description

      Currently NodeManager uses “cpu.cfs_period_us”, “cpu.cfs_quota_us” and “cpu.shares” to isolate cpu resource. However,

      • Linux Completely Fair Scheduling (CFS) is a throughput-oriented scheduler; no support for differentiated latency
      • Request latency of services running on container may be frequent shake when all containers share cpus, and latency-sensitive services can not afford in our production environment.

      So we need more fine-grained cpu isolation.

      Here we propose a solution using cgroup cpuset to binds containers to different processors, this is inspired by the isolation technique in Borg system.

      Attachments

        1. YARN-8320.001.patch
          30 kB
          Jiandan Yang
        2. CPU-isolation-for-latency-sensitive-services-v1.pdf
          322 kB
          Jiandan Yang
        3. CPU-isolation-for-latency-sensitive-services-v2.pdf
          318 kB
          Jiandan Yang

        Activity

          People

            Unassigned Unassigned
            yangjiandan Jiandan Yang
            Votes:
            1 Vote for this issue
            Watchers:
            17 Start watching this issue

            Dates

              Created:
              Updated: