Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
None
-
None
-
None
Description
The main idea of mesos/marathon is to sleep well, but after node reboot mesos task gets stuck in staging for about 4 hours.
To reproduce the issue:
- setup a mesos cluster in HA mode with systemd enabled mesos-master and mesos-slave service.
- run docker registry (https://hub.docker.com/_/registry/ ) with mesos constraint (hostname:LIKE:mesos-slave-1) in one node. Reboot the node and notice that task getting stuck in staging.
Possible workaround: service mesos-slave restart fixes the issue.
OS: centos 7.2
mesos version: 0.28.1
marathon: 1.1.1
zookeeper: 3.4.8
docker: 1.9.1 dockerAPIversion: 1.21
error message:
May 30 08:38:24 euca-10-254-237-140 mesos-slave[832]: W0530 08:38:24.120013 909 slave.cpp:2018] Ignoring kill task docker-registry.066fb448-2628-11e6-bedd-d00d0ef81dc3 because the executor 'docker-registry.066fb448-2628-11e6-bedd-d00d0ef81dc3' of framework 8517fcb7-f2d0-47ad-ae02-837570bef929-0000 is terminating/terminated
Attachments
Attachments
Issue Links
- duplicates
-
MESOS-7215 Race condition on re-registration of non-partition-aware frameworks
- Resolved