[MESOS-9934] Master does not handle returning unreachable agents as draining/deactivated - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Critical
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.9.0
Component/s: master
Labels:
- foundations

Epic Link:
Agent Draining
Sprint:
Foundations: RI-17 Sprint 52
Story Points:
3

Description

The master has two code paths for handling agent reregistration messages, one culminating in Master::__reregisterSlave and the other in Master::_reregisterSlave. The two paths are not continuations of each other. Looks like we missed the double-underscore case in the initial implementation. This is the path that unreachable agents take, when/if they come back to the cluster. The result is that when unreachable agents are marked for draining, they do not get sent the appropriate message unless they are forced to reregister again (i.e. restarted manually).

Attachments

Activity

People

Assignee:: Joseph Wu

Reporter:: Joseph Wu

Shepherd:: Greg Mann

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 12/Aug/19 18:11

Updated:: 13/Aug/19 23:29

Resolved:: 13/Aug/19 23:29