Uploaded image for project: 'Aurora'
  1. Aurora
  2. AURORA-117

Scheduler performance issues with very large jobs

    XMLWordPrintableJSON

Details

    • Task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.5.0
    • Reliability, Scheduler

    Description

      The scheduler tends to have performance issues when scheduling very large jobs. We've observed this with jobs exceeding 2000 instances. The TaskScheduler thread tends to consume a large amount of CPU (100%, limited by the global storage lock). Current hypothesis is that the majority of the time is spent satisfying diversity constraints (rack, machine), which require expensive queries.

      Attachments

        Issue Links

          Activity

            People

              wfarner Bill Farner
              wfarner Bill Farner
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: