Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-3214

Unhealthy datanodes repeatedly participate in pipeline creation

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • None
    • None
    • SCM

    Description

      steps taken :

      1) Mounted noise injection FUSE on all datanodes

      2) Selected 1 datanode from each open pipeline (factor=3)

      3) Injected WRITE FAILURE noise with error code - ENOENT on "hdds.datanode.dir" path of list of datanodes selected in step 2)

      4) start PUT key operation of size  32 MB.

       

      Observation :

      ----------------

      1. Commit failed, pipelines were moved to exclusion list.
      2. Client retries , new pipeline is created with same set of datanodes. Container creation fails as WRITE  FAILURE injection present.
      3. Pipeline is closed and the process is repeated for "ozone.client.max.retries" retries.

      Everytime, same set of datanodes are selected for pipeline creation which include 1 unhealthy datanode. 

      Expectation - pipeline should have been created by selecting 3 healthy  datanodes available.

       

      cc - ljain

       

      Attachments

        Issue Links

          Activity

            People

              erose Ethan Rose
              nilotpalnandi Nilotpal Nandi
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: