Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-7759 Improve Ozone Replication Manager
  3. HDDS-7847

Handle Replication of Unhealthy Replicas in RM

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.4.0
    • SCM

    Description

      Note: Unhealthy refers to ContainerReplicaProto#State UNHEALTHY.

      Unhealthy ratis replicas are currently not considered when checking for replication in RatisReplicationCheckHandler. Since they are somehow corrupted, as long as we have healthy replicas, unhealthy replicas are not touched. We need new functionality to delete excess unhealthy replicas (with special considerations for QUASI_CLOSED containers) if we have sufficient replication otherwise. If only unhealthy replicas are remaining, we need to replicate them like healthy replicas.

      This jira proposes:
      1. Having a new handler - RatisUnhealthyReplicationCheckHandler - in the chain of handlers. This will check for replication of all replicas, including unhealthy. We should reach here only if there is perfect replication otherwise or only unhealthy replicas are remaining.

      2. Add a flag to RatisContainerReplicaCount that should be enabled to consider unhealthy replicas. This will help accomplish point 1.

      3. Some changes to RatisUnderReplicationHandler and RatisOverReplicationHandler.

      Attachments

        Issue Links

          Activity

            People

              siddhant Siddhant Sangwan
              siddhant Siddhant Sangwan
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: