Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-8699 Further Replication Manager Improvements
  3. HDDS-9321

LegacyReplicationManager: Save UNHEALTHY replicas with highest BCSID for a QUASI_CLOSED container

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Critical
    • Resolution: Fixed
    • None
    • 1.4.0
    • SCM

    Description

      Mix of quasi-closed and unhealthy replicas blocks decommission even if sufficiently replicated.
      a. Caused when only some of the replicas hit the error during write.
      b. Can be fixed by removing this check:

      if (!replicaSet.isHealthy()) {
                if (LOG.isDebugEnabled()) {
                  unhealthyIDs.add(cid);
                }
                if (unhealthy < CONTAINER_DETAILS_LOGGING_LIMIT
      

      However, simply removing that check is not a complete solution. We need to try and preserve any UNHEALTHY replicas that have the greatest Sequence ID. This jira focus on the Legacy Replication Manager part of the fix to preserve such UNHEALTHY replicas.

      Attachments

        Issue Links

          Activity

            People

              siddhant Siddhant Sangwan
              siddhant Siddhant Sangwan
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: