Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-36057 Support volcano/alternative schedulers
  3. SPARK-36060

Support backing off dynamic allocation increases if resources are "stuck"

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 3.2.0
    • Fix Version/s: None
    • Component/s: Kubernetes
    • Labels:
      None

      Description

      In a over-subscribed environment we may enter a situation where our requests for more pods are not going to be fulfilled. Adding more requests for more pods is not going to help and may slow down the scheduler. We should detect this situation and hold off on increasing pod requests until the scheduler allocates more pods to us. We have a limited version of this in the Kube scheduler it's self but it would be better to plumb this all the way through to the DA logic.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              holden Holden Karau
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated: