Uploaded image for project: 'Apache YuniKorn'
  1. Apache YuniKorn
  2. YUNIKORN-946

Accounting resources for deleted executor pods

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Critical
    • Resolution: Fixed
    • 0.11
    • 0.12.1
    • core - scheduler
    • None

    Description

      Even when executors are deleted, YK UI shows that resources are consumed by the pod (the one which is already deleted). kubectl get pods  does not show the executor but YK UI shows the information of a deleted pod consuming resources even after few hours. 

      It results into leaking cluster resources.

      Steps:

      1. Run a spark application using k8s spark operator
      2. Wait for executors to be in running state.
      3. Delete the application using `kubectl delete sparkapplications <appName>` OR `kubectl delete {}f <yaml{-}file>`
      4. All the driver and executor pods would be deleted. check `kubectl get pods`
      5. However, YK UI still shows some of the executors running and consuming resources. It leads to leak of the resource as they are considered as used and could not be used by pending pods.

      More details: 

      https://yunikornworkspace.slack.com/archives/CLNUW68MU/p1637126093006900

       

      Attachments

        1. after-yunikorn-776.png
          535 kB
          Anuraag Nalluri
        2. before-yunikorn-776.png
          708 kB
          Anuraag Nalluri
        3. image-2021-11-16-23-17-42-819.png
          394 kB
          Ashutosh Singh
        4. image-2021-11-16-23-18-28-349.png
          171 kB
          Ashutosh Singh

        Issue Links

          Activity

            People

              anuraagn Anuraag Nalluri
              ashutosh-pepper Ashutosh Singh
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: