[SOLR-7819] ZkController.ensureReplicaInLeaderInitiatedRecovery does not respect retryOnConnLoss - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 5.2, 5.2.1
Fix Version/s: 5.4, 6.0
Component/s: SolrCloud
Labels:
- Jepsen

Description

~~SOLR-7245~~ added a retryOnConnLoss parameter to ZkController.ensureReplicaInLeaderInitiatedRecovery so that indexing threads do not hang during a partition on ZK operations. However, some of those changes were unintentionally reverted by ~~SOLR-7336~~ in 5.2.

I found this while running Jepsen tests on 5.2.1 where a hung update managed to put a leader into a 'down' state (I'm still investigating and will open a separate issue about this problem).

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

SOLR-7819.patch
09/Sep/15 14:27
43 kB
Shalin Shekhar Mangar
SOLR-7819.patch
08/Sep/15 19:17
44 kB
Shalin Shekhar Mangar
SOLR-7819.patch
07/Sep/15 13:52
30 kB
Shalin Shekhar Mangar
SOLR-7819.patch
29/Aug/15 05:28
30 kB
Shalin Shekhar Mangar
SOLR-7819.patch
31/Jul/15 19:16
30 kB
Shalin Shekhar Mangar
SOLR-7819.patch
27/Jul/15 11:48
9 kB
Shalin Shekhar Mangar

Issue Links

is related to

SOLR-7336 Add State enum to Replica

Closed

SOLR-7245 Temporary ZK election or connection loss should not stall indexing due to LIR

Closed

Activity

People

Assignee:: Shalin Shekhar Mangar

Reporter:: Shalin Shekhar Mangar

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Dates

Created:: 22/Jul/15 11:44

Updated:: 09/May/16 18:51

Resolved:: 10/Sep/15 11:09