[SOLR-8085] Fix a variety of issues that can result in replicas getting out of sync. - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Critical
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 5.4, 6.0
Component/s: None
Labels:
None

Description

I've been discussing this fail I found with Yonik.

The problem seems to be that a replica tries to recover and publishes recovering - the attempt then fails, but docs are now coming in from the leader. The replica tries to recover again and has gotten enough docs to pass peery sync.

I'm trying a possible solution now where we won't allow peer sync after a recovery that is not successful.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

fail.150922_125320
22/Sep/15 17:00
14.79 MB
Yonik Seeley
fail.150922_130608
22/Sep/15 17:18
8.01 MB
Yonik Seeley
SOLR-8085.patch
28/Sep/15 16:49
6 kB
Mark Miller
SOLR-8085.patch
24/Sep/15 19:10
3 kB
Mark Miller
SOLR-8085.patch
23/Sep/15 15:18
2 kB
Yonik Seeley

Issue Links

relates to

SOLR-8094 HdfsUpdateLog should not replay buffered documents as a replacement to dropping them.

Closed

Activity

People

Assignee:: Mark Miller

Reporter:: Mark Miller

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Dates

Created:: 22/Sep/15 16:25

Updated:: 09/May/16 18:49

Resolved:: 02/Oct/15 14:46