[SOLR-8034] If minRF is not satisfied, leader should not put replicas in recovery - ASF JIRA

Attach files

Attach Screenshot

Voters

Watch issue

Watchers

Create sub-task

Link

Clone

Update Comment Author

Replace String in Comment

Update Comment Visibility

Delete Comments

XML

Word

Printable

JSON

Details

Type: Improvement
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 5.4, 6.0
Component/s: SolrCloud
Labels:
- solrcloud

Description

If the minimum replication factor parameter (minRf) in a solr update request is not satisfied – i.e. if the update was not successfully applied on at least n replicas where n >= minRf – the shard leader should not put the failed replicas in "leader initiated recovery" and the client should retry the update instead.

This is so that in the scenario were minRf is not satisfied, the failed replicas can still be eligible to become a leader in case of leader failure, since in the client's perspective this update did not succeed.

This came up from a network partition scenario where the leader becomes sectioned off from its two followers, but they all could still talk to zookeeper. The partitioned leader set its two followers as in leader initiated recovery, so we couldn't just kill off the partitioned node and have a follower take over leadership. For a minRf=1 case, this is the correct behavior because the partitioned leader would have accepted updates that the followers don't have, and therefore we can't switch leadership or we'd lose those updates. However, in the case of minRf=2, solr never accepted any update in the client's point of view, so in fact the partitioned leader doesn't have any accepted update that the followers don't have, and therefore the followers should be eligible to become leaders. Thus, I'm proposing modifying the leader initiated recovery logic to not put the followers in recovery if the minRf parameter is present and is not satisfied.

Attachments

SOLR-8034.patch
10/Sep/15 19:56
9 kB
Jessica Cheng Mallet
SOLR-8034.patch
15/Sep/15 16:45
9 kB
Jessica Cheng Mallet

Issue Links

Add Link

is related to

SOLR-8062 Solr should raise an exception if minRf is specified and not achieved for an update request

Open

Delete this link

Activity

Comment

This comment will be Viewable by All Users Viewable by All Users

Cancel

People

Assignee:: Anshum Gupta

Reporter:: Jessica Cheng Mallet

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 10/Sep/15 19:53

Updated:: 03/Oct/18 16:47

Resolved:: 22/Sep/15 21:24

Agile

View on Board

If minRF is not satisfied, leader should not put replicas in recovery

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

Agile

Slack

Issue deployment