Description
The scheduler's /vars endpoint only exposes a tasks_lost_rack_XXX metric when tasks in a rack have been lost (a tasks_lost_rack_XXX key has a non-zero value). If no tasks in a rack have been lost then metrics for the rack are not exposed. This makes the metrics difficult to use for alerting purposes – it's impossible to tell whether the rack does not exist or exists but has had no lost tasks. Each rack should have an entry in /vars regardless of whether there have been any lost tasks.
Sample metrics:
tasks_lost_rack_aab 3
tasks_lost_rack_aae 4
tasks_lost_rack_aah 2
tasks_lost_rack_aai 3
Expected metrics:
tasks_lost_rack_aaa 0
tasks_lost_rack_aab 3
tasks_lost_rack_aac 0
tasks_lost_rack_aad 0
tasks_lost_rack_aae 4
tasks_lost_rack_aaf 0
tasks_lost_rack_aag 0
tasks_lost_rack_aah 2
tasks_lost_rack_aai 3