-
Notifications
You must be signed in to change notification settings - Fork 13.4k
Bump rustc-perf and update PGO crates #141490
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@bors try @rust-timer queue |
This comment has been minimized.
This comment has been minimized.
Bump rustc-perf and update PGO crates Updates rustc-perf to rust-lang/rustc-perf@8158f78, and updates the crates. r? `@Kobzol`
☀️ Try build successful - checks-actions |
This comment has been minimized.
This comment has been minimized.
Finished benchmarking commit (4c4ffc4): comparison URL. Overall result: ❌✅ regressions and improvements - please read the text belowBenchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf. Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @bors rollup=never Instruction countThis is the most reliable metric that we have; it was used to determine the overall result at the top of this comment. However, even this metric can sometimes exhibit noise.
Max RSS (memory usage)Results (primary -1.6%, secondary -1.0%)This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
CyclesResults (primary -1.0%, secondary -8.5%)This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
Binary sizeThis benchmark run did not return any relevant results for this metric. Bootstrap: 776.203s -> 774.104s (-0.27%) |
new solver perf improvements even without the crates being included is really funny 😆 i'm not missing anything here, right? |
Sorry, it's the same bug 🤦 Since some of the used benchmarks are prefixes of the new solver benchmarks, the new solver benchmarks are included even though we don't want them to be. I will fix it next week. CC @nnethercote I think I will just change the behavior of |
Can we |
But we actually want to add a few -new-solver benchmarks in a follow-up 🫠 |
It's a shame to have to type the extra chars, but yeah, the false matches are really annoying. I wonder if we could allow "exact match" or "exact match but without the version number at the end". E.g. |
I thought about this a bit more, and maybe to avoid breaking existing users it would be better to just add a new flag, e.g. |
rust-lang/rustc-perf#2139 should hopefully fix this. |
Bumping rustc-perf to |
Some changes occurred in src/tools/opt-dist cc @Kobzol |
@bors try @rust-timer queue |
This comment has been minimized.
This comment has been minimized.
Bump rustc-perf and update PGO crates Updates rustc-perf to rust-lang/rustc-perf@8158f78, and updates the crates. r? `@Kobzol`
Finished benchmarking commit (b7413ea): comparison URL. Overall result: ❌✅ regressions and improvements - please read the text belowBenchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf. Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @bors rollup=never Instruction countThis is the most reliable metric that we have; it was used to determine the overall result at the top of this comment. However, even this metric can sometimes exhibit noise.
Max RSS (memory usage)Results (primary -0.3%, secondary 1.5%)This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
CyclesResults (primary -1.5%, secondary -3.3%)This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
Binary sizeResults (primary -1.1%)This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
Bootstrap: 780.121s -> 777.226s (-0.37%) |
We changed both the PGO training set and the benchmarks that we measure, so we have to take what we get. The binary size regression of the toolchain is a bit annoying, but I guess there's not much to do here. @bors r+ |
Bump rustc-perf and update PGO crates Updates rustc-perf to rust-lang/rustc-perf@8158f78, and updates the crates. r? `@Kobzol`
This comment has been minimized.
This comment has been minimized.
💔 Test failed - checks-actions |
@Kobzol we’ve had the same issue when trying individual benchmarks with cargo, maybe they need [workspace] in their Cargo.toml. We used to do that when adding the benchmarks before, but I don’t know if it’s been done for this round of updates? |
Hmm, I wonder why distcheck tries to load these files. But yeah, this is a bug, fixed in rust-lang/rustc-perf#2144. |
Ok, bumped to master once again. |
Thanks! @bors r+ |
☀️ Test successful - checks-actions |
What is this?This is an experimental post-merge analysis report that shows differences in test outcomes between the merged PR and its parent PR.Comparing 13718eb (parent) -> 1bbd62e (this PR) Test differencesNo test diffs found Test dashboardRun cargo run --manifest-path src/ci/citool/Cargo.toml -- \
test-dashboard 1bbd62e547ba5cc08ccb44c27def3d33195d2dd5 --output-dir test-dashboard And then open Job duration changes
How to interpret the job duration changes?Job durations can vary a lot, based on the actual runner instance |
Finished benchmarking commit (1bbd62e): comparison URL. Overall result: ❌✅ regressions and improvements - please read the text belowOur benchmarks found a performance regression caused by this PR. Next Steps:
@rustbot label: +perf-regression Instruction countThis is the most reliable metric that we have; it was used to determine the overall result at the top of this comment. However, even this metric can sometimes exhibit noise.
Max RSS (memory usage)Results (primary -2.0%, secondary -2.8%)This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
CyclesResults (primary -1.0%, secondary -1.4%)This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
Binary sizeThis benchmark run did not return any relevant results for this metric. Bootstrap: 780.508s -> 778.428s (-0.27%) |
Updates rustc-perf to rust-lang/rustc-perf@8158f78, and updates the crates.
r? @Kobzol