Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
1.17.0
-
None
Description
Currently, regressions are detected by a simple script which may have false positives and false negatives, especially for benchmarks with small absolute values, small value changes would cause large percentage changes. see here for details.
And all benchmarks are executed on one physical machine, it might happen that hardware issues affect performance, like "FLINK-18614 Performance regression 2020.07.13".
This ticket aims to improve the precision and recall of the regression-check script.
Attachments
Issue Links
- causes
-
FLINK-31561 flink-master-regression-check is failing since March 15, 2023
- Closed
- is a parent of
-
FLINK-27571 Recognize "less is better" benchmarks in regression detection script
- Closed
- links to