Uploaded image for project: 'Aurora'
  1. Aurora
  2. AURORA-45

Scheduler should wait for registered to be called before attempting to invoke driver

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.5.0
    • Scheduler
    • None

    Description

      We have observed the scheduler attempting to kill tasks before registered() had been called. This resulted in the driver dropping those attempts on the floor. Since the driver didn't signal failure to the scheduler (but instead logged an error) the scheduler wrote a KILLING state transition to the replicated log and signaled success to the client. Since the killTasks message was never sent the task timed out and the task continued to run until the GC executor reconciled state.

      Attachments

        Activity

          People

            wfarner Bill Farner
            wfarner Bill Farner
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: