Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-5771

Speed up TestDatanodeHddsVolumeFailureToleration by reducing dead interval

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.2.0
    • 1.2.0
    • test

    Description

      In TestDatanodeHddsVolumeFailureToleration, the tests wait for a long time on a DN to be marked dead by SCM. This is because the stale, dead and heartbeat intervals have not been set low enough, as they are using the default. Changing the various timers, the runtime is reduced from about 4mins to 53 seconds on my laptop.

      Further, there are two tests - one that fails one volume on a DN and ensures it can restart ok. Then a test which fails 2 volumes and ensure it fails to start. We can combine these two tests into 1 and save creating a second mini-Cluster. This further reduces the runtime to 37s.

      On github, this test runs in:

      [INFO] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed: 236.534 s - in org.apache.hadoop.ozone.dn.volume.TestDatanodeHddsVolumeFailureToleration
      

      Attachments

        Issue Links

          Activity

            People

              sodonnell Stephen O'Donnell
              sodonnell Stephen O'Donnell
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: