Details
-
Bug
-
Status: Resolved
-
Blocker
-
Resolution: Fixed
-
None
Description
If the master fails over and an agent does not re-register within the `agent_reregister_timeout`, the master marks the agent as unreachable in the registry and sends `slaveLost` for it. However, we neglected to update the master's in-memory state for the newly unreachable agent; this meant that task reconciliation would return incorrect results (until/unless the next master failover).