Uploaded image for project: 'Apache YuniKorn'
  1. Apache YuniKorn
  2. YUNIKORN-2665

Gang app originator pod changes after restart

    XMLWordPrintableJSON

Details

    Description

      Gang app choose the first pod (who created the app) as originator pod which becomes the real driver pod later. While processing gang app specifically after the placeholder creation and in the process of replacement, restart can lead to the below described incorrect behaviour:

      During restore, there is no guarantee on the ordering of pods coming from K8s lister especially when all the pods created with the same second timestamp. k8s use the seconds based timestamp, which means all pods created with in same second has same timestamp. During this situation, whichever pod comes first from lister, YK designate it as originator pod. So, any placeholder could become the originator pod and actual originator pod has been lost. This change could cause rippling effects leading to weird behaviour and needs to be fixed.

      Attachments

        Issue Links

          Activity

            People

              mani Manikandan R
              mani Manikandan R
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: