test(IDX): only enable `flaky = True` for tests that are >= 1% flaky #4325

basvandijk · 2025-03-11T16:52:20Z

Tests marked with flaky = True are not cached by bazel which means they'll run for every PR regardless of whether they ran successfully before. This hurts CI performance. Therefor we only mark tests as flaky for tests that were actually more than 1% flaky over the last month.

We also remove flaky = False settings since that's the default.

rs/sns/integration_tests/BUILD.bazel

github-actions

If this pull request affects the behavior of any canister owned by
the Governance team, remember to update the corresponding
unreleased_changes.md file(s).

To acknowldge this reminder (and unblock the PR), dismiss this
code review by going to the bottom of the pull request page, and
supply one of the following reasons:

Done.
No canister behavior changes.

github-actions

If this pull request affects the behavior of any canister owned by
the Governance team, remember to update the corresponding
unreleased_changes.md file(s).

To acknowldge this reminder (and unblock the PR), dismiss this
code review by going to the bottom of the pull request page, and
supply one of the following reasons:

Done.
No canister behavior changes.

github-actions

If this pull request affects the behavior of any canister owned by
the Governance team, remember to update the corresponding
unreleased_changes.md file(s).

To acknowldge this reminder (and unblock the PR), dismiss this
code review by going to the bottom of the pull request page, and
supply one of the following reasons:

Done.
No canister behavior changes.

No canister behavior changes.

mbjorkqvist · 2025-03-11T18:14:09Z

Thanks for your relentless work on improving our testing, and in particular the flakiness @basvandijk! Do you have some concrete numbers or estimates on the impact on CI performance? Looking at the issue from another PoV, this will potentially increase the time that developers need to spend on PRs, either looking into the logs of failed (now non-flaky) tests to figure out if they were caused by their changes, or alternatively simply clicking retry once or twice (possibly with some delay) before deciding to investigate further? Yet another viewpoint is that some of the tests that are now being marked non-flaky are actually not flaky (anymore), with the rare failures being due to e.g., unrelated transient infrastructure issues, in which case removing the flaky flag definitely makes sense.

basvandijk · 2025-03-11T18:44:34Z

@mbjorkqvist

Do you have some concrete numbers or estimates on the impact on CI performance?

No I don't have concrete numbers nor an estimate. It's hard to predict what effect this will have in practise.

Looking at the issue from another PoV, this will potentially increase the time that developers need to spend on PRs, either looking into the logs of failed (now non-flaky) tests to figure out if they were caused by their changes, or alternatively simply clicking retry once or twice (possibly with some delay) before deciding to investigate further?

True, which is why we limited the removal of flake = True to those tests with a flakiness rate of < 1%.

Yet another viewpoint is that some of the tests that are now being marked non-flaky are actually not flaky (anymore), with the rare failures being due to e.g., unrelated transient infrastructure issues, in which case removing the flaky flag definitely makes sense.

Yes, the vast majority of tests where I removed flaky = True had a flakiness rate of 0%.

Note that we will extend Superset with some new charts that show the tests that were run as part of multiple attempts of a single workflow (i.e. those workflows that were manually retried) and that had both more than 0 failures and more than 0 successes, i.e. flaky tests.

test(IDX): only enable flaky = True for tests that are >= 1% flaky

938757d

basvandijk added the CI_ALL_BAZEL_TARGETS Runs all bazel targets and uploads them to S3 label Mar 11, 2025

github-actions bot added the test label Mar 11, 2025

nmattia approved these changes Mar 11, 2025

View reviewed changes

rs/sns/integration_tests/BUILD.bazel Outdated Show resolved Hide resolved

fix dates

bd393b2

basvandijk removed the CI_ALL_BAZEL_TARGETS Runs all bazel targets and uploads them to S3 label Mar 11, 2025

basvandijk marked this pull request as ready for review March 11, 2025 17:50

basvandijk requested review from a team as code owners March 11, 2025 17:50

github-actions bot requested changes Mar 11, 2025

View reviewed changes

github-actions bot previously requested changes Mar 11, 2025

View reviewed changes

github-actions bot added @sdk @execution @consensus @node @nns-team @ic-message-routing-owners @finint labels Mar 11, 2025

github-actions bot added @crypto-team @cross-chain-team @boundary-node @research @idx @pocket-ic labels Mar 11, 2025

blind-oracle approved these changes Mar 11, 2025

View reviewed changes

marko-k0 approved these changes Mar 11, 2025

View reviewed changes

mraszyk approved these changes Mar 11, 2025

View reviewed changes

mbjorkqvist approved these changes Mar 11, 2025

View reviewed changes

r-birkner approved these changes Mar 11, 2025

View reviewed changes

lwshang approved these changes Mar 11, 2025

View reviewed changes

basvandijk enabled auto-merge March 11, 2025 19:20

altkdf approved these changes Mar 11, 2025

View reviewed changes

andrewbattat approved these changes Mar 11, 2025

View reviewed changes

fspreiss approved these changes Mar 11, 2025

View reviewed changes

ShuoWangNSL approved these changes Mar 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(IDX): only enable `flaky = True` for tests that are >= 1% flaky #4325

test(IDX): only enable `flaky = True` for tests that are >= 1% flaky #4325

basvandijk commented Mar 11, 2025

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

mbjorkqvist commented Mar 11, 2025

basvandijk commented Mar 11, 2025

test(IDX): only enable flaky = True for tests that are >= 1% flaky #4325

Are you sure you want to change the base?

test(IDX): only enable flaky = True for tests that are >= 1% flaky #4325

Conversation

basvandijk commented Mar 11, 2025

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

mbjorkqvist commented Mar 11, 2025

basvandijk commented Mar 11, 2025

test(IDX): only enable `flaky = True` for tests that are >= 1% flaky #4325

test(IDX): only enable `flaky = True` for tests that are >= 1% flaky #4325