Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
0.28.0
-
CentOS 6.7 running in VirtualBox
-
Mesosphere Sprint 29
Description
The issue first arose in MESOS-3271, but can be reproduced every time by using the mentioned environment and running:
sudo ./bin/mesos-tests.sh --gtest_filter="MemoryPressureMesosTest.CGROUPS_ROOT_SlaveRecovery" --gtest_repeat=1000
The problem can be traced back to libevent_poll.cpp. If the event is triggered and the the future associated with the event is discarded, the situation arises in which pollCallback() starts executing just early enough to finish before pollDiscard() executes. If that happens, pollCallback() deletes the poll object and pollDiscard() is left with a dangling pointer which crashes when it executes the line event_active(ev, EV_READ, 0);.
Attachments
Issue Links
- blocks
-
MESOS-3271 SlaveRecoveryTest/0.NonCheckpointingFramework is flaky.
- Accepted