Uploaded image for project: 'Apache YuniKorn'
  1. Apache YuniKorn
  2. YUNIKORN-3 Add scheduling metrics throughout the scheduling cycle
  3. YUNIKORN-647

Add new metrics to monitor pending applications: "long_pending_app"

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Invalid
    • None
    • None
    • core - common
    • None

    Description

      Based on our observation, if there is one application pending for more than a threshold (e.g. 10 minutes), the scheduler is likely down.

      We would like to capture it for more timely alerting.

      Attachments

        Activity

          People

            chenya_zhang Chenya Zhang
            chenya_zhang Chenya Zhang
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: