[SOLR-8367] The new LIR 'all replicas participate' failsafe code needs to be improved. - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 5.5, 6.0
Component/s: None
Labels:
None

Description

For one, it currently only kicks in the first attempted leader. If it's another replica that is stuck in LIR, it won't help.

Second, when we attempt to be leader, knowing we might fail due to LIR, we should not put other replicas into recovery if they fail to sync with us - not until we know we will actually be leader.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SOLR-8367.patch
08/Dec/15 15:21
6 kB
Mark Miller
SOLR-8367.patch
08/Dec/15 22:42
16 kB
Mark Miller
SOLR-8367.patch
09/Dec/15 15:00
17 kB
Mark Miller

Issue Links

relates to

SOLR-8075 Leader Initiated Recovery should not stop a leader that participated in an election with all of it's replicas from becoming a valid leader.

Closed

SOLR-8279 Add a new test fault injection approach and a new SolrCloud test that stops and starts the cluster while indexing data and with random faults.

Closed

Activity

People

Assignee:: Mark Miller

Reporter:: Mark Miller

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 04/Dec/15 00:49

Updated:: 09/May/16 18:58

Resolved:: 10/Dec/15 16:22