Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Cannot Reproduce
-
None
-
None
-
None
-
None
Description
Hello, I have a very simple set up: 2 shards and 2 replicas (4 nodes in total).
What I did is just stopped the shards, but if first shard stopped immediately the second one took about 5 minutes to stop. You can see on the screenshot what happened next. In short:
1. Shard 1 stopped normally
3. Replica 1 became a leader
2. Shard 2 still was performing some job but wasn't accepting connection
4. Replica 2 did not became a leader because Shard 2 is still there but doesn't work
5. Entire cluster went down until Shard 2 stopped and Replica 2 became a leader
Marked as critical because this shuts down the entire cluster. Please adjust if I am wrong.