Details
-
Story
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
When a replica joins Paxos with an empty log, we don't allow this replica to fully participate in Paxos immediately. Instead, this replica is treated as a non-voting member, meaning that it will not reply to any requests from other replicas. It simply learns those log entries that have been agreed on and tries to catch up the leader. When the catch-up process is done, we re-admit this replica to Paxos and allow it to vote.
If we have a disk failure and want to swap a master machine, we can simply start the scheduler on a new machine with an empty log (and that's it).
Attachments
Issue Links
- blocks
-
MESOS-962 A developer guide on the catch-up log for the Mesos developer community
- Open
-
MESOS-961 A blog post on the catch-up log for the Mesos user community
- Resolved
- depends upon
-
MESOS-807 Discard is not propagated in process::dispatch.
- Resolved
-
MESOS-806 Allowing converting from an Owned<T> to a Shared<T>.
- Resolved
- is depended upon by
-
MESOS-770 Rate control and randomization of Replicated Log catching-up
- Accepted
- is related to
-
MESOS-993 Performance issue during replicated log catch-up when the initial log position is large
- Resolved
- relates to
-
MESOS-683 Support reconfigurable replicated log
- Open