Uploaded image for project: 'Aurora'
  1. Aurora
  2. AURORA-1240

Ignore JobUpdateSettings.maxWaitToInstanceRunningMs in the scheduler

    Details

    • Type: Task
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.8.0
    • Component/s: Scheduler
    • Labels:
      None

      Description

      The UpdateConfig restart_theshold [1] setting does not appear to deliver much user value as it's highly sensitive to scheduling performance and may result in aborted/rolled back job updates when set too low.

      Some background: This timeout controls task transition from PENDING to RUNNING during the job update. In the event of cluster capacity shortage, assigning a task to a host may take considerably longer thus expiring the timeout and depending on the failure settings causing an unnecessary job update abort or rollback. It was meant to give users some protection against unsatisfiable resource/constraint requirements. In reality though, it proved to be rather an annoyance to users when an update is interrupted due to unexpected delay in task assignment.

      Consider deprecating and subsequently removing this setting.

      This ticket tracks a first step to ignore this value in the scheduler updater. See linked tickets for follow-up work.

      [1] - https://github.com/apache/aurora/blob/master/docs/configuration-reference.md#updateconfig-objects

        Issue Links

          Activity

          Hide
          wfarner Bill Farner added a comment -
          Show
          wfarner Bill Farner added a comment - Scheduler change: https://reviews.apache.org/r/32840/
          Hide
          wfarner Bill Farner added a comment -

          Opting to make no client change for now, so that the 0.8.0 client is compatible with the 0.7.0 scheduler.

          Show
          wfarner Bill Farner added a comment - Opting to make no client change for now, so that the 0.8.0 client is compatible with the 0.7.0 scheduler.

            People

            • Assignee:
              wfarner Bill Farner
              Reporter:
              maximk Maxim Khutornenko
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development