Uploaded image for project: 'Apache YuniKorn'
  1. Apache YuniKorn
  2. YUNIKORN-553 [Umbrella] Gang scheduling enhancements
  3. YUNIKORN-582

Consider a fallback mechanism to schedule the app in case of gang failure instead of marking the app as failed

    XMLWordPrintableJSON

Details

    Description

      Incases when the app encounters gang issues due to placeholder pod allocation(failed due to various reasons), currently yunikorn marks the app failed.

      Instead, consider a configurable option for hard or soft gang scheduling which allows fallback mechanism to schedule the app successfully. This needs to be brain stormed to see if this makes sense. Let us use this jira for documenting all the thoughts.

      Attachments

        Activity

          People

            kmarton Kinga Marton
            ayubpathan Ayub Pathan
            Votes:
            1 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: