Uploaded image for project: 'Aurora'
  1. Aurora
  2. AURORA-1242

Consider new task states to track package fetching and task launching

    XMLWordPrintableJSON

Details

    • Story
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Executor, Scheduler
    • None

    Description

      There is currently little to no visibility into task launching activities. Once a task reaches RUNNING it's considered entering an application realm where the only way to dissect the warmup period is through examining thermos processes (if they exist). The warmup may take arbitrary long time to complete exacerbating the visibility problem even further.

      Another example is docker container pull (AURORA-1059) where a task stays in ASSIGNED until a docker pull completes. This skews our SLA metrics and risks aborting the task due to exceeding transient task timeout.

      We should consider adding more task states to track package/container fetch and launching/warmup activities explicitly, e.g.:
      ASSIGNED -> FETCHING ->LAUNCHING|STARTING -> RUNNING

      The above would require modifying the schema to require explicit definition for package fetching.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              maximk Maxim Khutornenko
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: