Details
-
Improvement
-
Status: Open
-
Low
-
Resolution: Unresolved
-
None
-
None
Description
My idea is to augment the gossip protocol to contain timestamps. We wouldn't use the timestamps for anything "important", but we could use them to allow each node to expose a number which is the number of milliseconds (or seconds) "old" information is about nodes that are "the oldest" and also alive.
When nodes go down you'd see spikes, but for most cases where nodes live, this information should give you a pretty good idea of how fast gossip information is propagating through the cluster, assuming you keep your clocks in synch.
It should be a good thing to have graphed, and to have alerts on.