Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-9652

Legacy RM will not replicate all unhealthy containers when some are decommissioning

    XMLWordPrintableJSON

Details

    Description

      In a situation where a container is closed or quasi-closed with all unhealthy replicas, the legacy Replication Manager will not replicate the unhealthy containers if the under replication is caused by decommission.

      Ie, if we have 3 replicas:

      IN_SERVICE, unhealthy
      IN_SERVICE, unhealthy
      DECOMMISSIOING, unhealthy

      Then the legacy RM will take no action and decommission will be stuck.

      The new RM handles this state correctly and does not need to be changed.

      Additionally, in KeyValueContainer.java, unhealthy replicas cannot be exported / replicated, so even if the RM was not present, replication still could not happen.

      This change corrects both issues and adds a couple of extra tests to the new RM to prove it works ok.

      Attachments

        Issue Links

          Activity

            People

              sodonnell Stephen O'Donnell
              sodonnell Stephen O'Donnell
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: