LLVM_ENABLE_RUNTIMES=flang-rt for flang-runtime-cuda-* #393

Meinersbur · 2025-02-25T17:06:02Z

Add depends_on_projects=['flang-rt'] to the flang-runtime-cuda-gcc and flang-runtime-cuda-clang builders. This prepares the removal of the "projects" build of the flang runtime in llvm/llvm-project#124126.

Split off from #333

Affected builders:

flang-runtime-cuda-gcc
This previously only built the runtime using the top-level CMakeLists.txt in flang/runtime/CMakeLists.txt. This is going to be replaced with the "standalone runtimes build", with the top-level runtimes/CMakeLists.txt. This still needs Flang to successed, hence replacing with a bootstrap-build where the FLANG_RT_* options are internally forwarded to the runtimes build.
flang-runtime-cuda-clang
This is a manual bootstrapping build which first compiles Clang, then the runtime out-of-tree. This is replaced with a standalone runtimes build as described above. Because it needs Flang, also adding Flang to the enabled projects of the stage1 build.

Neither build runs the check-* targets, probably due to the lack of actual CUDA hardware which running the runtime unittests require.

Affected workers:

as-builder-7

Admins listed for those workers:

Galina Kistanova [email protected]

vzakhari · 2025-02-26T16:44:06Z

Thank you, Michael! For flang-runtime-cuda-gcc build, does it imply that flang will be always rebuilt? If yes, then we will need to decide what to do about the increased time required by the build. I think there is some limit, but I do not know for sure.

Meinersbur · 2025-02-26T18:49:15Z

Thank you, Michael! For flang-runtime-cuda-gcc build, does it imply that flang will be always rebuilt?

Yes, but it uses ccache so re-builds usually don't take the maximum time.

If yes, then we will need to decide what to do about the increased time required by the build. I think there is some limit, but I do not know for sure.

LLVM's Buildbot is configured without absolute timeout. There are much slower workers in labs.llvm.org, taking up 16 hours to build.

If the as-builder-7 is becoming too slow¹, I can combine flang-runtime-cuda-gcc and flang-runtime-cuda-clang so Flang is built only once. For Polly builders, I tricked it to use the same ccache cache for all builders.

as-builder-7 is also building llvm-nvptx-nvidia-ubuntu and llvm-nvptx64-nvidia-ubuntu. ↩

vzakhari · 2025-02-26T19:09:26Z

Thanks for the explanation, Michael! It looks good to me, but I am not the one to approve it.

vvereschaka

LGTM

The production buildbot master apparently has not yet been restarted since llvm/llvm-zorg#393 landed. This reverts commit 96d1bae.

Meinersbur · 2025-03-26T19:05:12Z

@gkistanova Landing llvm/llvm-project#124126 depends on this configuration update becoming active. Could you restart https://lab.llvm.org/buildbot ? Last restart was Feb 12.

Meinersbur · 2025-04-01T13:43:04Z

@gkistanova After the buildbot master restart, as-builder-7 behaves strangely. Could you have a look?

gkistanova · 2025-04-01T18:19:25Z

I do not think anything is wrong with the worker.

We are still investigating, but it seems the failures are related to recent changes to flang-rt.

Meinersbur · 2025-04-02T13:44:50Z

It is reporting connection failures and progress timeouts ("command timed out: 1200 seconds without output running") during the build-flang-rt/build-flang-default step, before even bulding flang-rt. The only possible explanation I have is that without building the flang-rt targets in-between, there are more of the memory-intensive flang compilation jobs running concurrently (from up to two concurrent builds the worker is configured to do), leading to swapping and eventual denial-of-service.

vvereschaka · 2025-04-03T04:07:05Z

I tried to figure out the source of that situation and only I found that the flang related components became an extremally resource consumed during the build. Currently both of flang-runtime-cuda-* builders have 99% chances to freeze the build host even they getting started alone without concurrency with the other builds.
The single build and 64 threads instead of 128 solves a freezing of the host during the build, but also the host became underutilized with that configuration. It would be good to reduce resource (memory?) consumption when building the flang parts.

Here is the build timings for the gcc build (16 threads):
gcc-build-tracing-core.json
gcc-build-tracing-runtime.json
(to see the graph, open chrome://tracing/ in Chrome and open json file)

The most time consuming files are flang related as far as I noticed.

I have updated the builder configurations to use less threads and only one concurrent build - #424 It should help for now, but I hope we will be able to get back at least two concurrent builds on the worker later.

Meinersbur · 2025-04-03T14:57:47Z

Compiling flang taking an unusual amount of memory is a known issue and will not change without getting away from the template-centric architecture. llvm/llvm-project#127364 introduced a way to limit the number of flang compile jobs only, but note that those will also not fall under the LLVM_PARALLEL_COMPILE_JOBS limit anymore.

vvereschaka · 2025-04-04T01:52:28Z

Compiling flang taking an unusual amount of memory is a known issue and will not change without getting away from the template-centric architecuture

Oh, I see. Ok, I'll play with FLANG_PARALLEL_COMPILE_JOBS, thank you for pointing. Probably It may help to load the build host more optimally.

vvereschaka requested review from vzakhari, gkistanova and lukel97 February 25, 2025 22:13

flang-runtime-cuda-gcc and flang-runtime-cuda-clang

0097624

Meinersbur force-pushed the flang_runtime_flang-runtime-cuda branch from c424475 to 0097624 Compare February 26, 2025 08:34

Meinersbur marked this pull request as ready for review February 26, 2025 08:34

Meinersbur mentioned this pull request Feb 26, 2025

[Flang] Remove FLANG_INCLUDE_RUNTIME llvm/llvm-project#124126

Merged

Meinersbur requested a review from vvereschaka March 4, 2025 17:01

vvereschaka approved these changes Mar 4, 2025

View reviewed changes

Meinersbur mentioned this pull request Mar 5, 2025

LLVM_ENABLE_RUNTIMES=flang-rt for flang-aarch64-out-of-tree #388

Merged

Meinersbur merged commit 17c126f into llvm:main Mar 6, 2025
2 checks passed

Meinersbur added a commit to llvm/llvm-project that referenced this pull request Mar 26, 2025

Revert "[Flang] Remove FLANG_INCLUDE_RUNTIME (#124126)"

27539c3

The production buildbot master apparently has not yet been restarted since llvm/llvm-zorg#393 landed. This reverts commit 96d1bae.

Meinersbur mentioned this pull request Mar 27, 2025

LLVM_ENABLE_RUNTIMES=flang-rt for ppc64-flang-aix #391

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLVM_ENABLE_RUNTIMES=flang-rt for flang-runtime-cuda-* #393

LLVM_ENABLE_RUNTIMES=flang-rt for flang-runtime-cuda-* #393

Uh oh!

Meinersbur commented Feb 25, 2025 •

edited

Loading

Uh oh!

vzakhari commented Feb 26, 2025

Uh oh!

Meinersbur commented Feb 26, 2025 •

edited

Loading

Uh oh!

vzakhari commented Feb 26, 2025

Uh oh!

vvereschaka left a comment

Uh oh!

Uh oh!

Meinersbur commented Mar 26, 2025

Uh oh!

Meinersbur commented Apr 1, 2025

Uh oh!

gkistanova commented Apr 1, 2025

Uh oh!

Meinersbur commented Apr 2, 2025 •

edited

Loading

Uh oh!

vvereschaka commented Apr 3, 2025

Uh oh!

Meinersbur commented Apr 3, 2025 •

edited

Loading

Uh oh!

vvereschaka commented Apr 4, 2025

Uh oh!

Uh oh!

LLVM_ENABLE_RUNTIMES=flang-rt for flang-runtime-cuda-* #393

LLVM_ENABLE_RUNTIMES=flang-rt for flang-runtime-cuda-* #393

Uh oh!

Conversation

Meinersbur commented Feb 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vzakhari commented Feb 26, 2025

Uh oh!

Meinersbur commented Feb 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Footnotes

Uh oh!

vzakhari commented Feb 26, 2025

Uh oh!

vvereschaka left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Meinersbur commented Mar 26, 2025

Uh oh!

Meinersbur commented Apr 1, 2025

Uh oh!

gkistanova commented Apr 1, 2025

Uh oh!

Meinersbur commented Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vvereschaka commented Apr 3, 2025

Uh oh!

Meinersbur commented Apr 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vvereschaka commented Apr 4, 2025

Uh oh!

Uh oh!

Meinersbur commented Feb 25, 2025 •

edited

Loading

Meinersbur commented Feb 26, 2025 •

edited

Loading

Meinersbur commented Apr 2, 2025 •

edited

Loading

Meinersbur commented Apr 3, 2025 •

edited

Loading