Description
Proposed Behaviour
After running regression tests we should automatically check the geomean of various run-time & QoR metrics to verify they are within the expected valid ranges.
Current Behaviour
Currently the regression test scripts check QoR ranges on a per-benchmark basis.
Since all of these metrics have some noise which can cause spurious failures the per-benchmark ranges are set relatively loose.
Possible Solution
We should extend parse_vtr_task.pl to produce geomean metric values (in addition to the per-benchmark results generated in parse_results.txt).
It should also be extended to then check that these geomean values are within the expected ranges and cause an error if not.
Currently parse_vtr_task.pl produces a file called qor_results.txt
its not clear what this does and how it differs from the parse_results.txt
or whether it could be used/changed to support this feature. If not clear why its there it could be removed.
Context
We should perform more stringent automated QoR checking to catch potential errors/bugs/degredations sooner.
A related issue is #371.
Loosely related #545.