Description
I recently looked into some problem related to racing between LIR and Recovering. I would like to propose a totally new approach to solve SOLR-5495 problem because fixing current implementation by a bandage will lead us to other problems (we can not prove the correctness of the implementation).
Feel free to give comments/thoughts about this new scheme.
https://docs.google.com/document/d/1dM2GKMULsS45ZMuvtztVnM2m3fdUeRYNCyJorIIisEo/edit?usp=sharing
Attachments
Attachments
Issue Links
- is duplicated by
-
SOLR-9945 LIR should check the node is recovering before bring it down
-
- Resolved
-
- relates to
-
SOLR-10398 Multiple LIR requests can fail PeerSync even if it succeeds
-
- Resolved
-
-
SOLR-9555 Leader incorrectly publishes state for replica when it puts replica into LIR.
-
- Closed
-
- supercedes
-
SOLR-6236 Need an optional fallback mechanism for selecting a leader when all replicas are in leader-initiated recovery.
-
- Resolved
-