Uploaded image for project: 'Flink'
  1. Flink
  2. FLINK-27571

Recognize "less is better" benchmarks in regression detection script

    XMLWordPrintableJSON

Details

    Description

      Example benchmark:

      http://codespeed.dak8s.net:8000/timeline/#/?exe=5&ben=schedulingDownstreamTasks.BATCH&extr=on&quarts=on&equid=off&env=2&revs=200

       

      Proposed solution:

      I think #2 is the correct way.
      Maybe we can modify the save_jmh_result.py to correctly set the 'units' and the 'lessisbetter' fields of benchmark results. The 'units' is already contained in the jmh result and the 'lessisbetter' can be derived from the mode(false if it is 'thrpt' mode, otherwise true). An example of the jmh result format can be found at https://i.stack.imgur.com/vB3fV.png.
      This can fix the web UI as well as the REST result, and then the regression_report.py will be able to identify which benchmarks are "less is better" and treat them differently.

       

      Attachments

        1. image-2022-12-29-14-39-59-976.png
          802 kB
          Yanfei Lei
        2. Screenshot_2022-05-09_10-33-11.png
          104 kB
          Roman Khachatryan

        Issue Links

          Activity

            People

              Yanfei Lei Yanfei Lei
              roman Roman Khachatryan
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: