Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-6496

Improve CI checks

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: In Progress
    • Major
    • Resolution: Unresolved
    • None
    • None
    • build, CI
    • None

    Description

      Some ideas for improvement:

      • compile (11) is not necessary for acceptance test changes, when compilation is necessary only to get the binary tarball for running the tests.
      • basic checks are deterministic (except basic (unit) to some degree). Failures can only be fixed by code change, so we can restore fail-fast behavior.
      • basic checks are also quick, so we can postpone starting integration until these finish.

      Attachments

        Issue Links

          1.
          Skip compile(11) for non-java changes Sub-task Resolved Attila Doroszlai  
          2.
          Reduce OzoneFS test combinations in ozonesecure Sub-task Resolved Attila Doroszlai  
          3.
          Create compat acceptance split Sub-task Resolved Attila Doroszlai  
          4.
          Avoid repeating some S3 tests for different bucket types Sub-task Resolved Attila Doroszlai  
          5.
          Further splits integration tests Sub-task Resolved Attila Doroszlai  
          6.
          Verify build from source release tarball in CI Sub-task Resolved Attila Doroszlai  
          7.
          Omit download progress in builds Sub-task Resolved Attila Doroszlai  
          8.
          Add CI to check pull request title Sub-task Resolved Kaijie Chen  
          9.
          Increase timeout for basic check Sub-task Resolved Attila Doroszlai  
          10.
          Disallow ellipsis in PR title Sub-task Resolved Attila Doroszlai  
          11.
          Ignore pr_title_check for selective checks Sub-task Resolved Attila Doroszlai  
          12.
          Skip installing awscli if already present Sub-task Resolved Attila Doroszlai  
          13.
          Timed out tests missing from bundle Sub-task Resolved Attila Doroszlai  
          14.
          Separate scheduled CI from push/PR workflows Sub-task Resolved Attila Doroszlai  
          15.
          Suppress warning about long filenames in tar Sub-task Resolved Attila Doroszlai  
          16.
          Kubernetes check should fail fast if cluster cannot start Sub-task Resolved Attila Doroszlai  
          17.
          Acceptance tests need rebalancing Sub-task Resolved Attila Doroszlai  
          18.
          Reorder OM nodes in HA acceptance tests Sub-task Resolved Attila Doroszlai  
          19.
          Reduce S3 acceptance test setup time Sub-task Resolved Attila Doroszlai  
          20.
          Run HA secure tests from s3g container Sub-task Resolved Attila Doroszlai  
          21.
          Disable JaCoCo for PRs and in forks Sub-task Resolved Attila Doroszlai  
          22.
          Selective checks: handle change in junit.sh Sub-task Resolved Attila Doroszlai  
          23.
          Selective checks: handle change in ci.yaml Sub-task Resolved Attila Doroszlai  
          24.
          Increase timeout of CI build Sub-task Resolved Attila Doroszlai  
          25.
          Fail integration tests if RocksDB objects are leaked Sub-task Resolved Attila Doroszlai  
          26.
          Support multiple acceptance test scripts with custom config Sub-task Resolved Attila Doroszlai  
          27.
          Update compose README Sub-task Resolved Attila Doroszlai  
          28.
          Let run.sh start more than 3 datanodes Sub-task Resolved Attila Doroszlai  
          29.
          Selective checks: skip acceptance tests for compose doc changes Sub-task Resolved Attila Doroszlai  
          30.
          Test Hadoop compatibility in Ozone HA environment Sub-task Resolved Attila Doroszlai  
          31.
          Convert s3-haproxy to an add-on Sub-task Resolved Attila Doroszlai  
          32.
          Reduce ozone sh calls in robot tests Sub-task Resolved Attila Doroszlai  
          33.
          Reuse same Ozone cluster for Hadoop tests Sub-task Resolved Attila Doroszlai  
          34.
          Create separate check for native build Sub-task Resolved Attila Doroszlai  
          35.
          Ensure docker env is stopped Sub-task Resolved Attila Doroszlai  
          36.
          error validating kustomization.yaml Sub-task Resolved Attila Doroszlai  
          37.
          Keep separate Robot XMLs if rebot fails to combine them Sub-task Resolved Attila Doroszlai  
          38.
          Rebalance integration tests Sub-task Resolved Attila Doroszlai  
          39.
          Update instructions in dependency check Sub-task Resolved Attila Doroszlai  
          40.
          Add dedicated log file for each service in docker test Sub-task Resolved Attila Doroszlai  
          41.
          Error creating stack dump in S3 HA environment Sub-task Resolved Attila Doroszlai  
          42.
          Skip acceptance/kubernetes for license changes Sub-task Resolved Attila Doroszlai  
          43.
          Extract nodes required for SCM decommission test Sub-task Resolved Attila Doroszlai  
          44.
          Create separate acceptance split for cert rotation Sub-task Resolved Attila Doroszlai  
          45.
          Create a script to list all acceptance test splits Sub-task Resolved Attila Doroszlai  
          46.
          Merge unit and integration checks into a combined one Sub-task Resolved Attila Doroszlai  
          47.
          Require successful basic checks for long-running tests Sub-task Resolved Devesh Kumar Singh  
          48.
          Summarize failures of acceptance/kubernetes checks Sub-task Resolved Attila Doroszlai  
          49.
          Mark disabled tests using custom tag/category Sub-task Resolved Devesh Kumar Singh  
          50.
          Intermittent error while removing docker network Sub-task Resolved Szabolcs Gál  
          51.
          Handle empty unit check Sub-task Resolved Attila Doroszlai  
          52.
          Fail checks in Summary step instead of Test Sub-task Resolved Attila Doroszlai  
          53.
          Introduce a new workflow for testing intermittently failing unit tests. Sub-task Resolved Sadanand Shenoy  
          54.
          Parameter for fail-fast behavior in flaky-test-check Sub-task Resolved Attila Doroszlai  
          55.
          Upload artifacts of unit/native check Sub-task Resolved Attila Doroszlai  
          56.
          Allow disabling JUnit timeouts Sub-task Resolved Attila Doroszlai  
          57.
          Increase timeout of unit check Sub-task Resolved Attila Doroszlai  
          58.
          Group output of check steps Sub-task Resolved Himanshi Darvekar  
          59.
          Improve flaky-test-check job name. Sub-task Resolved Sadanand Shenoy  
          60.
          Remove scheduled CI run Sub-task Resolved Attila Doroszlai  
          61.
          Skip unhealthy tests in repeated runs workflow Sub-task Resolved Attila Doroszlai  
          62.
          Redirect pre-test build output for integration check Sub-task Resolved Attila Doroszlai  
          63.
          Introduce FlakyTest category for flaky JUnit4 tests Sub-task Resolved Attila Doroszlai  
          64.
          Rebalance acceptance tests Sub-task Resolved Attila Doroszlai  
          65.
          Count total number of failures in flaky-test-check Sub-task Resolved Attila Doroszlai  
          66.
          Do not run all checks for draft PR touching CI files Sub-task Resolved Attila Doroszlai  
          67.
          Resolve dependabot build issues when updating npm packages Sub-task Resolved Abhishek Pal  
          68.
          Create workflow for running acceptance test repeatedly Sub-task Resolved Attila Doroszlai  
          69.
          Skip coverage check for dependabot Sub-task Resolved Attila Doroszlai  
          70.
          Skip push build for dependabot Sub-task Resolved Attila Doroszlai  
          71.
          Use secure cluster for upgrade acceptance tests Sub-task Resolved Attila Doroszlai  
          72.
          Check that dependencies have acceptable license Sub-task Resolved Attila Doroszlai  
          73.
          Set descriptive title for upgrade Robot tests Sub-task Resolved Attila Doroszlai  
          74.
          Create stack dump if cannot remove docker network Sub-task Resolved Attila Doroszlai  
          75.
          Checkstyle check passing despite config error Sub-task Resolved Attila Doroszlai  
          76.
          Selective checks: skip tests for IntelliJ config change Sub-task Resolved Attila Doroszlai  
          77.
          Split parts of misc suite Sub-task Resolved Attila Doroszlai  
          78.
          Intermittent failure at hadoop-maven-plugin in compile check Sub-task Resolved Attila Doroszlai  
          79.
          Simplify Maven cache for dependencies in CI Sub-task Resolved Attila Doroszlai  
          80.
          Let junit.sh fail if no tests were matched in repeated run Sub-task Resolved Attila Doroszlai  
          81.
          Pre-build Ozone in flaky-test-check Sub-task Resolved Attila Doroszlai  
          82.
          Collect logs from cancelled integration test Sub-task Resolved Attila Doroszlai  
          83.
          Enable Surefire process checker Sub-task Resolved Attila Doroszlai  
          84.
          Remove validation of test method in flaky-test-check Sub-task Resolved Attila Doroszlai  
          85.
          Node.js 16 actions are deprecated Sub-task Resolved Attila Doroszlai

          100%

          Original Estimate - Not Specified Original Estimate - Not Specified
          Time Spent - 1h
          86.
          Increase timeout for compile check Sub-task Resolved Attila Doroszlai  
          87.
          Group dependabot PRs for frontend dependencies Sub-task Resolved Attila Doroszlai  
          88.
          Download zlib fails with 403 Forbidden in CI Sub-task Resolved Attila Doroszlai  
          89.
          Intermittent failure downloading from sourceware.org Sub-task Resolved Attila Doroszlai  
          90.
          Add check for dependencies included in Ozone FS shaded jar Sub-task Open Unassigned  
          91.
          No logs if pod stuck in PodInitializing Sub-task Open Szabolcs Gál  
          92.
          Consistently enable exit-on-error in test scripts Sub-task Open Unassigned  
          93.
          Test Ozone with Ratis "snapshot" Sub-task Open Attila Doroszlai  
          94.
          Allow running nested test classes in flaky-test-check Sub-task Resolved Attila Doroszlai  
          95.
          Intermittent failure in compile (macos-12) Sub-task Open Unassigned  
          96.
          Selective checks: skip tests for Dashboard changes Sub-task Resolved Ritesh Shukla  
          97.
          Schedule dependabot for weekend Sub-task Resolved Attila Doroszlai  
          98.
          Populate dependency cache in separate workflow Sub-task Resolved Attila Doroszlai  
          99.
          Fetch dependencies using actual build Sub-task Resolved Attila Doroszlai  
          100.
          Repeated tests stop if fork does not return Sub-task Patch Available Attila Doroszlai  
          101.
          Downgrade Surefire to 3.0.0-M4 Sub-task Resolved Attila Doroszlai  
          102.
          Checkstyle summary excludes errors with "xml" Sub-task Resolved Attila Doroszlai  
          103.
          Exclude proto3 classes from coverage Sub-task Resolved Attila Doroszlai  
          104.
          Rename unit check to native Sub-task Resolved Attila Doroszlai  
          105.
          Remove custom dependabot workflow Sub-task Resolved Attila Doroszlai  
          106.
          Only bump npm packages for security Sub-task Open Attila Doroszlai  

          Activity

            People

              adoroszlai Attila Doroszlai
              adoroszlai Attila Doroszlai
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1h
                  1h