Uploaded image for project: 'Apache YuniKorn'
  1. Apache YuniKorn
  2. YUNIKORN-201

[Umbrella] Application tracking API and CRD - phase 1

Details

    Description

      Today, YK works behind the scene, and the workflow is like

      1. app operator or job server launch a bunch of pods on K8s
      2. YK gets notified and group pods to apps based on appID
      3. YK schedules the pods with respect to the app info

      This provides a simple model to integrate with existing K8s and to support workloads, but it has some user experience issues. Such as

      1. YK can hardly manage the app lifecycle end to end. An outstanding issue is we do not know when an app is finished if we only look at the pod status. 
      2. YK doesn't have ability to admit apps. We need the ability to admit app based on various conditions, e.g resource quota, cluster overhead, ACL, etc. 
      3. Hard to track app status. Sometimes app might be pending in resource queues, but we do not have a good way to expose such status info.

      To further improve the user experience, we need to introduce an application tracking API and K8s custom resource definition (CRD). The CRD will be used by app operator/job server to interact with YK, to get the lifecycle fully controlled.

      Attachments

        Issue Links

          Activity

            People

              kmarton Kinga Marton
              wwei Weiwei Yang
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: