Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-6548 HDDS Stability Fixes and Improvements
  3. HDDS-6549

Do not mark container replica as unhealthy on write failure

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      If an operation fails on an open/ closing container on a DN, it is marked unhealthy so that subsequent write transactions also fail. The container itself is marked for closing by sending a request to SCM.

      Even if a write failed, other data in the container could be intact. Marking containers unhealthy should be a conservative operation. If let's say write fails on all replicas of a container, then all the replicas would be marked unhealthy. HDDS-6447 will ensure that replicas are not deleted if all the replicas are in unhealthy state. Irrespective of that, a container should not be marked as unhealthy when there is readable uncorrupted data in it.

      Currently, we cannot mark the container with any other state as all container state changes are initiated from the SCM. If we quasi close a container replica, instead of marking it unhealthy, when there is a write failure, then all the state transition logic in SCM would also have to be changed/updated.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              hanishakoneru Hanisha Koneru
              Votes:
              1 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: