Uploaded image for project: 'Apache YuniKorn'
  1. Apache YuniKorn
  2. YUNIKORN-2322

Investigate YuniKorn stuck when scheduling latency is high

    XMLWordPrintableJSON

Details

    • Task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • None
    • core - common
    • None

    Description

      We are seeing service stuck when latency increases, even cluster has resource, YuniKorn will not be able to schedule apps. We have to manually restart YuniKorn.

      we did profiling to find out most time are used by tryReservedAllocate. 

      Attached ** profiling screenshot and service latency data.

      Attachments

        Activity

          People

            rainieli Rainie Li
            rainieli Rainie Li
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: