Details
-
Sub-task
-
Status: Resolved
-
Critical
-
Resolution: Fixed
-
None
Description
Mix of quasi-closed and unhealthy replicas blocks decommission even if sufficiently replicated.
a. Caused when only some of the replicas hit the error during write.
b. Can be fixed by removing this check:
if (!replicaSet.isHealthy()) { if (LOG.isDebugEnabled()) { unhealthyIDs.add(cid); } if (unhealthy < CONTAINER_DETAILS_LOGGING_LIMIT
However, simply removing that check is not a complete solution. We need to try and preserve any UNHEALTHY replicas that have the greatest Sequence ID. https://issues.apache.org/jira/browse/HDDS-9321 takes care of the Legacy Replication Manager side of things to preserve such UNHEALTHY replicas. This jira focuses on the Decommissioning side.
Attachments
Issue Links
- requires
-
HDDS-9321 LegacyReplicationManager: Save UNHEALTHY replicas with highest BCSID for a QUASI_CLOSED container
- Resolved
- links to