Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-6793

Duplicated reservation in Fair Scheduler preemption

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Duplicate
    • 2.8.1, 3.0.0-alpha3
    • None
    • fairscheduler
    • None

    Description

      There is a delay between preemption happen and containers are killed. If resources released from nodes before container killing are not enough for the resource request preemption asking for, reservation happens again at that node.
      E.g. scheduler reserves <memory 2048, vcore 2> in node 1 for app 1 while preemption. It will take 15s by default to kill containers in node 1 for fulfill that resource requests. If <memory 1024, vcore 1> was released from node 1 before the killing, scheduler reserves <memory 2048, vcore 2> again in node 1 for app1. The second reservation may never be unreserved.

      Attachments

        Issue Links

          Activity

            People

              yufeigu Yufei Gu
              yufeigu Yufei Gu
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: