Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
None
Description
The legacy replication manager currently resolves mismatched replicas (those whose replica state do not match SCM's container state) by
1. Replicating the matching replicas until they are fully replicated.
2. Deleting the mismatched replicas.
This approach does not work when LRM is presented with the following situation:
SCM state: CLOSED
5 datanodes in the cluster
Replica states CLOSED CLOSED QUASI QUASI QUASI
LRM will not make progress because there is no datanode to add a closed replica to that does not already have a replica.
Attachments
Issue Links
- is related to
-
HDDS-8536 ReplicationManager: Unhealthy replicas could block Ratis containers being recovered
- Resolved
- links to