ci: add pre-release performance quality gates prototype #13506

igoragoli · 2025-05-26T13:29:21Z

https://datadoghq.atlassian.net/browse/APMSP-2000

Overview

Add a prototype for pre-release performance quality gates. This prototype doesn't actually gate anything.

Motivation

Necessary to validate the approach proposed in the pre-release performance quality gates RFC.

Testing

Testing pipelines:

https://gitlab.ddbuild.io/DataDog/apm-reliability/dd-trace-py/-/pipelines/66090033 (containing extra testing jobs)
https://gitlab.ddbuild.io/DataDog/apm-reliability/dd-trace-py/-/pipelines/66116717 (most recent one)

Threshold configuration files added in: https://github.com/DataDog/benchmarking-platform/pull/157

Notifications are correctly being sent to #pre-release-gates-prototype.

Checklist

PR author has checked that all the criteria below are met
The PR description includes an overview of the change
The PR description articulates the motivation for the change
The change includes tests OR the PR description describes a testing strategy
The PR description notes risks associated with the change, if any
Newly-added code is easy to change
The change follows the library release note guidelines
The change includes or references documentation updates if necessary
Backport labels are set (if applicable)

Reviewer Checklist

Reviewer has checked that all the criteria below are met
Title is accurate
All changes are related to the pull request's stated goal
Avoids breaking API changes
Testing strategy adequately addresses listed risks
Newly-added code is easy to change
Release note makes sense to a user of the library
If necessary, author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment
Backport labels are set in a manner that is consistent with the release branch maintenance policy

github-actions · 2025-05-26T13:29:57Z

CODEOWNERS have been resolved as:

.gitlab/benchmarks/macrobenchmarks.yml                                  @DataDog/python-guild @DataDog/apm-core-python

github-actions · 2025-05-26T13:49:35Z

Bootstrap import analysis

Comparison of import times between this PR and base.

Summary

The average import time from this PR is: 270 ± 10 ms.

The average import time from base is: 270 ± 10 ms.

The import time difference between this PR and base is: -6.1 ± 0.5 ms.

Import time breakdown

The following import paths have shrunk:

ddtrace.auto 1.549 ms (0.58%)

ddtrace.bootstrap.sitecustomize 0.855 ms (0.32%)

ddtrace.bootstrap.preload 0.855 ms (0.32%)

ddtrace.internal.remoteconfig.client 0.855 ms (0.32%)

ddtrace 0.694 ms (0.26%)

pr-commenter · 2025-05-26T14:14:43Z

Benchmarks

Benchmark execution time: 2025-05-29 19:10:39

Comparing candidate commit 02f0eaa in PR branch igoragoli/add-pre-release-gates with baseline commit 520cbf5 in branch main.

Found 1 performance improvements and 1 performance regressions! Performance is the same for 506 metrics, 8 unstable metrics.

scenario:iastdjangostartup-iast

🟩 execution_time [-438.653ms; -215.568ms] or [-18.403%; -9.044%]

scenario:telemetryaddmetric-1-gauge-metric-1-times

🟥 execution_time [+157.983ns; +236.050ns] or [+7.364%; +11.002%]

erikayasuda

LGTM, just one NB question 👍

erikayasuda · 2025-05-30T15:10:20Z

.gitlab/benchmarks/macrobenchmarks.yml

+check-warning-breaches:
+  extends: .check-threshold-breaches
+  script:
+    - cd platform && (git init && git remote add origin https://gitlab-ci-token:${CI_JOB_TOKEN}@gitlab.ddbuild.io/DataDog/benchmarking-platform && git pull origin python/macrobenchmarks)
+    - bp-runner bp-runner.fail-on-breach.warning.yml


NB: I saw in the last pipeline that the macrobenchmarks pipeline was triggered and the SLO warning job failed. Initially I thought that we should be setting this as allow_fail: true so that a warning doesn't become a blocker for PRs... but then I realized that would make it silent on the main branch, which is what we're trying to alleviate 😅 If the intention of these checks is only for pre-release gating, maybe we can make this only trigger on main and release branches?

What are your thoughts? 🤔

Suggested change

check-warning-breaches:

extends: .check-threshold-breaches

script:

- cd platform && (git init && git remote add origin https://gitlab-ci-token:${CI_JOB_TOKEN}@gitlab.ddbuild.io/DataDog/benchmarking-platform && git pull origin python/macrobenchmarks)

- bp-runner bp-runner.fail-on-breach.warning.yml

check-warning-breaches:

extends: .check-threshold-breaches

only:

- main

- /^v[0-9]+\.[0-9]+\.[0-9]+$/

script:

- cd platform && (git init && git remote add origin https://gitlab-ci-token:${CI_JOB_TOKEN}@gitlab.ddbuild.io/DataDog/benchmarking-platform && git pull origin python/macrobenchmarks)

- bp-runner bp-runner.fail-on-breach.warning.yml

Hi!

so that a warning doesn't become a blocker for PRs

I've made some updates. In summary, only one check-slo-breaches job is necessary, and it'll check for warnings with a range defined on the thresholds file (bp-runner.fail-on-breach.yml). Warnings won't fail the job 👍

we can make this only trigger on main and release branches?

Exactly, benchmarks considered for gating releases should be run on main and release branches.

This is not the case for Python macrobenchmarks, which only run on schedule: https://github.com/DataDog/dd-trace-py/blob/main/.gitlab-ci.yml#L91-L94

brettlangdon

something we found, the main trigger job needs to be strategy: depend for the trigger

brettlangdon

we might want to try and do similar as the microbenchmarks which will use the artifacts from the download_ddtrace_artifacts and use the pre-built wheels instead of installing from source which takes a few minutes.

igoragoli · 2025-06-04T09:57:56Z

Closing this due to updates on how breaches are checked, and due to the example PR I've added: #13584

@erikayasuda, this example PR can be a basis for the PR you need to add for https://datadoghq.atlassian.net/browse/APMSP-2051.

igoragoli added 3 commits May 26, 2025 11:02

Add gating and notification jobs

5d4bce5

Remove testing jobs

d6c223a

Switch benchmarking-platform branch to python/macrobenchmarks

e4f537b

igoragoli requested review from a team as code owners May 26, 2025 13:29

igoragoli requested review from RamyElkest and erikayasuda May 26, 2025 13:29

github-actions bot added the backport 2.21 label May 26, 2025

igoragoli added 2 commits May 26, 2025 15:46

Remove "needs:" from threshold breach checks

095c96b

Run threshold breach checks independently of macrobenchmark failures

67c747a

emmettbutler approved these changes May 27, 2025

View reviewed changes

Merge branch 'main' into igoragoli/add-pre-release-gates

02f0eaa

erikayasuda added the changelog/no-changelog A changelog entry is not required for this PR. label May 29, 2025

erikayasuda approved these changes May 30, 2025

View reviewed changes

brettlangdon reviewed May 30, 2025

View reviewed changes

igoragoli closed this Jun 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ci: add pre-release performance quality gates prototype #13506

ci: add pre-release performance quality gates prototype #13506

Uh oh!

igoragoli commented May 26, 2025 •

edited by erikayasuda

Loading

Uh oh!

github-actions bot commented May 26, 2025

Uh oh!

github-actions bot commented May 26, 2025 •

edited

Loading

Uh oh!

pr-commenter bot commented May 26, 2025 •

edited

Loading

Uh oh!

erikayasuda left a comment

Uh oh!

erikayasuda May 30, 2025

Uh oh!

igoragoli Jun 4, 2025

Uh oh!

brettlangdon left a comment

Uh oh!

brettlangdon left a comment

Uh oh!

igoragoli commented Jun 4, 2025

Uh oh!

Uh oh!

ci: add pre-release performance quality gates prototype #13506

ci: add pre-release performance quality gates prototype #13506

Uh oh!

Conversation

igoragoli commented May 26, 2025 • edited by erikayasuda Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Motivation

Testing

Checklist

Reviewer Checklist

Uh oh!

github-actions bot commented May 26, 2025

Uh oh!

github-actions bot commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bootstrap import analysis

Summary

Import time breakdown

Uh oh!

pr-commenter bot commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks

scenario:iastdjangostartup-iast

scenario:telemetryaddmetric-1-gauge-metric-1-times

Uh oh!

erikayasuda left a comment

Choose a reason for hiding this comment

Uh oh!

erikayasuda May 30, 2025

Choose a reason for hiding this comment

Uh oh!

igoragoli Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

brettlangdon left a comment

Choose a reason for hiding this comment

Uh oh!

brettlangdon left a comment

Choose a reason for hiding this comment

Uh oh!

igoragoli commented Jun 4, 2025

Uh oh!

Uh oh!

igoragoli commented May 26, 2025 •

edited by erikayasuda

Loading

github-actions bot commented May 26, 2025 •

edited

Loading

pr-commenter bot commented May 26, 2025 •

edited

Loading