Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-9280

Quasi-closed container with unhealthy replicas may remain under-replicated in 4 node cluster

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • None
    • None
    • SCM
    • None

    Description

      This is a follow-up based on discussion around HDDS-8536 in this PR. Replication manager will try to keep all replicas with unique origin node IDs for quasi-closed containers that cannot be closed, while fully replicating the healthy replicas. This could require up to 5 replicas to satisfy, for example:

      SCM state: Quasi-closed
      Replica states: quasi-closed(origin 1) quasi-closed(origin 1) quasi-closed(origin 1) unhealthy(origin 2) unhealthy(origin 3)

      In a 4 node cluster, or a 5 node cluster trying to decommission a node, this would remain under-replicated or get stuck. In this case, we need to choose how to proceed.

      Attachments

        1. QUASI_CLOSED Containers in a Small Cluster.pdf
          77 kB
          Siddhant Sangwan
        2. Quasi_dilemma.png
          32 kB
          Siddhant Sangwan

        Issue Links

          Activity

            People

              siddhant Siddhant Sangwan
              erose Ethan Rose
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: