[KUDU-798] Proper safe time advancement by leaders and replicas - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Critical
Resolution: Fixed
Affects Version/s: M5
Fix Version/s: 1.2.0
Component/s: consensus, tserver
Labels:
None

Target Version/s:

1.2.0

Description

Safe time is the time after which replicas can serve snapshot reads.

Leaders should only advance the safe time as they commit an operation in their own term (the initial safe time should be the time of that initial operation) and replicas should only advance the safe time based on what the leader says.

If this is not the case then replicas can write to replica A (leader), get back 10 as a write timestamp and read from replica B which still hasn't replicated it and not see its own write. This can even happen while reading from just leaders if B was elected but hadn't yet applied all the previous writes locally.

The attached log provides an example of the latter case

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

log.txt
29/May/15 19:02
411 kB
David Alves

Issue Links

blocks

KUDU-1056 Make ksck support safe time and rework ksck snapshot tests

Resolved

is blocked by

KUDU-1042 Consider serializing timestamp assignment for non-tablet operations

Resolved

Activity

People

Assignee:: David Alves

Reporter:: David Alves

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 29/May/15 19:02

Updated:: 08/Dec/16 16:24

Resolved:: 08/Dec/16 16:24