Uploaded image for project: 'Apache YuniKorn'
  1. Apache YuniKorn
  2. YUNIKORN-1187

[Umbrella] Recovery stabilization

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.1.0
    • shim - kubernetes
    • None

    Description

      In the past weeks, we discovered numerous problems that can occur during the recovery phase. We need to make that part more reliable, because jobs can get stuck, internal states are not restored properly, etc.

      Attachments

        Issue Links

          There are no Sub-Tasks for this issue.

          Activity

            People

              pbacsko Peter Bacsko
              pbacsko Peter Bacsko
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: