[IGNITE-17738] Cluster must be able to fix the partition inconsistency on restart/node_join by itself - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Sub-task
Status: Resolved
Priority: Major
Resolution: Won't Fix
Affects Version/s: None
Fix Version/s: None
Component/s: None
Labels:
- iep-31
- ise

Ignite Flags:

Docs Required, Release Notes Required

Description

On cluster restart (because of power-off, OOM or some other problem) it's possible to have PDS inconsistent (primary partitions may contain operations missed on backups as well as counters may contain gaps even on primary).

1) Currently, "historical rebalance" is able to sync the data to the highest LWM for every partition.
Most likely, a primary will be chosen as a rebalance source, but the data after the LWM will not be rebalanced. So, all updates between LWM and HWM will not be synchronized.
See PartialHistoricalRebalanceTest.java

2) In case LWM is the same on primary and backup, rebalance will be skipped for such partition.
See SkippedRebalanceBecauseOfTheSameLwmTest.java

Proposals:

1) Cheap fix
A possible solution is for the case when the cluster failed and restarted (same baseline) is to fix the counters automatically (when cluster composition is equal to the baseline specified before the crash).

Counters should be set as

HWM at primary and as LWM at backups for caches with 2+ backups,
LWM at primary and as HWM at backups for caches with a single backup.

2) Complex fix (when baseline changed)
Rebalance must honor whole counter state (LWM, HWM, gaps).
2.0) Primary HWM must be set to the highest HWM across the copies to avoid reapplying of already applied update counters on backups.
2.1) In case when WAL is available all entries between LWM and HWM (including) must be rebalanced to other nodes where they are required.
Even from backups to the primary.
Such approach may require rebalance as a prerequisite to activation finish.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

PartialHistoricalRebalanceTest.java
14/Oct/22 15:23
11 kB
Anton Vinogradov
SkippedRebalanceBecauseOfTheSameLwmTest.java
14/Oct/22 15:49
10 kB
Anton Vinogradov

Issue Links

is related to

IGNITE-17949 Full rebalance must be restricted when it causes any updates loss.

Open

relates to

IGNITE-11611 If partition consistency cannot be restored during rebalance using counters the most recent partition data should be used.

Open

links to

GitHub Pull Request #10420

Activity

People

Assignee:: Maxim Muzafarov

Reporter:: Anton Vinogradov

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 21/Sep/22 17:36

Updated:: 06/Dec/22 14:19

Resolved:: 06/Dec/22 14:19

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

0.5h