Add AMDGCN option similar to `cuda-compute-capabilities` #4860

Thyre · 2025-04-25T19:22:03Z

Summary

This PR aims to implement a similar option to cuda-compute-capabilities (and related options) for AMD GPUs.
The option can then replace the manual handling done in some EasyBlocks, e.g. Clang & LLVM, allowing to enable (some) GPU builds without the need to alter the EasyConfig.

Most of the handling was copied from CUDA, while some options were skipped as they don't make much sense, e.g. cuda_cc_space_sep_no_period.

The used regex should support all GPU architectures starting from gfx600, including the more recent generic targets.
Actual compiler support then needs to be present in the compiler consuming these architectures. Both GCC and LLVM accept the same naming, i.e. gfx[...], including generic targets.

Missing features compared to CUDA

cuda_cache_dir option is missing. I haven't found something similar for HIP yet, but may simply have missed it
"int only" options are missing, though hard to provide with generic targets and targets like gfx90a
- Maybe a target without gfx?

More to be determined.

Known issues

The regex for generic targets is not perfect, allowing e.g. gfx10--generic to pass, even though it is not allowed.

Resolves #4829

easybuild/framework/easyconfig/default.py

test/framework/easyconfig.py

Thyre · 2025-05-10T13:04:49Z

Started to create a test set of EasyConfig & EasyBlock changes to test the option, starting with LLVM & CMake...
The next logical step would be to build some HIP application with CMake, and maybe try something more special like AdaptiveCpp. I'll use a system ROCm for this, but at the end, everything should also work with an EB built ROCm.

Let's see if this works the way I expect.

https://github.com/Thyre/easybuild-custom/tree/support-passing-amdgcn

Signed-off-by: Jan Andre Reuter <[email protected]>

AMD doesn't name this compute capabilities, and amdhsa is only used when lowering to HSA (but amdpal & mesa3d are also possible). Therefore, simple the name option 'amdgcn-capabilities'. Signed-off-by: Jan Andre Reuter <[email protected]>

This allows users to handle cases like LLVM, where building with GPU support is optional, but users might still want to install the software without GPU support. Signed-off-by: Jan Andre Reuter <[email protected]>

Signed-off-by: Jan André Reuter <[email protected]>

Micket

lgtm

I really don't have any hardware to test any of this on. I trust you have tested this quite a bit?

Micket · 2025-07-15T14:15:34Z

We are hitting rate limits (again?)
We need to rethink those frameworks tests. Bunch of issues like this

ERROR: test_fetch_easyconfigs_from_commit (test.framework.github.GithubTest)
Test fetch_easyconfigs_from_commit function.
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/tmp/runner/a2a179c7e5ef5c3a44bda1281d10211f5940d494/lib/python3.8/site-packages/test/framework/github.py", line 561, in test_fetch_easyconfigs_from_commit
    res = fetch_easyconfigs_from_commit(test_commit)
  File "/tmp/runner/a2a179c7e5ef5c3a44bda1281d10211f5940d494/lib/python3.8/site-packages/easybuild/tools/github.py", line 807, in fetch_easyconfigs_from_commit
    return fetch_files_from_commit(commit, files=files, path=path, github_repo=GITHUB_EASYCONFIGS_REPO)
  File "/tmp/runner/a2a179c7e5ef5c3a44bda1281d10211f5940d494/lib/python3.8/site-packages/easybuild/tools/github.py", line 748, in fetch_files_from_commit
    raise EasyBuildError(error_msg, exit_code=EasyBuildExit.FAIL_GITHUB)
easybuild.tools.build_log.EasyBuildError: 'Failed to download diff for easybuilders/easybuild-easyconfigs commit 6515b44cd84a20fe7876cb4bdaf3c0080e688566! (HTTP Error 403: rate limit exceeded)'

Thyre · 2025-07-15T14:30:02Z

lgtm

I really don't have any hardware to test any of this on. I trust you have tested this quite a bit?

I've basically used this to build all of the ROCm software on two separate machines which I'm trying to bring to EasyBuild (after my vacation).

You'll find quite a few test reports from my Arch Linux machine (or jrc0850) with the config parameter being in the config.

Some test reports:

Moving openmp to a runtime in newer versions of LLVM easybuild-easyblocks#3799 (comment)
{vis}[GCCcore/14.2.0] LLVM v20.1.7, Mesa v25.1.3, lit v18.1.8, ... easybuild-easyconfigs#23144 (comment)
{vis}[GCCcore/14.2.0] LLVM v20.1.7, Mesa v25.1.3, lit v18.1.8, ... easybuild-easyconfigs#23144 (comment)

What I haven‘t explicitly tested (again) is using the generic targets, also because they‘re still quite new in ROCm.
Let me try that (and explicitly passing nothing to ensure that e.g. LLVM 19 works with ’gfx1201` in the config file) works as expected. That will have to wait until next week though.

Micket · 2025-07-15T19:34:55Z

OK so i'll let you also test that before merging then? I'll also be away traveling after this week, so if anyone else wants to hit merge please go ahead.

Thyre · 2025-07-15T19:45:54Z

Yeah, I'll test those things once I'm back home. If everything works, I'll ping in our merge-sprint channel 😄

Crivella · 2025-07-23T08:40:12Z

easybuild/tools/options.py

+            amdgcn_cc_regex = re.compile(r'gfx[0-9]+[a-z]?$')
+            # Generic convention.
+            # Regex is not perfect, as it doesn't catch gfx[...]--generic
+            amdgcn_generic_regex = re.compile(r'gfx[0-9]+[-]?[0-9]?-generic$')


Should the -NUMBER be in a group? EG

Suggested change

amdgcn_generic_regex = re.compile(r'gfx[0-9]+[-]?[0-9]?-generic$')

amdgcn_generic_regex = re.compile(r'gfx[0-9]+(\-[0-9])?-generic$')

Atleast from the LLVM 20.1.7 targets i dont see any --generic ones without the number in between

crivella@crivella-desktop:~$ llc -march=amdgcn -mattr=help Available CPUs for this target: bonaire - Select the bonaire processor. carrizo - Select the carrizo processor. fiji - Select the fiji processor. generic - Select the generic processor. generic-hsa - Select the generic-hsa processor. gfx10-1-generic - Select the gfx10-1-generic processor. gfx10-3-generic - Select the gfx10-3-generic processor. gfx1010 - Select the gfx1010 processor. gfx1011 - Select the gfx1011 processor. gfx1012 - Select the gfx1012 processor. gfx1013 - Select the gfx1013 processor. gfx1030 - Select the gfx1030 processor. gfx1031 - Select the gfx1031 processor. gfx1032 - Select the gfx1032 processor. gfx1033 - Select the gfx1033 processor. gfx1034 - Select the gfx1034 processor. gfx1035 - Select the gfx1035 processor. gfx1036 - Select the gfx1036 processor. gfx11-generic - Select the gfx11-generic processor. gfx1100 - Select the gfx1100 processor. gfx1101 - Select the gfx1101 processor. gfx1102 - Select the gfx1102 processor. gfx1103 - Select the gfx1103 processor. gfx1150 - Select the gfx1150 processor. gfx1151 - Select the gfx1151 processor. gfx1152 - Select the gfx1152 processor. gfx1153 - Select the gfx1153 processor. gfx12-generic - Select the gfx12-generic processor. gfx1200 - Select the gfx1200 processor. gfx1201 - Select the gfx1201 processor. gfx600 - Select the gfx600 processor. gfx601 - Select the gfx601 processor. gfx602 - Select the gfx602 processor. gfx700 - Select the gfx700 processor. gfx701 - Select the gfx701 processor. gfx702 - Select the gfx702 processor. gfx703 - Select the gfx703 processor. gfx704 - Select the gfx704 processor. gfx705 - Select the gfx705 processor. gfx801 - Select the gfx801 processor. gfx802 - Select the gfx802 processor. gfx803 - Select the gfx803 processor. gfx805 - Select the gfx805 processor. gfx810 - Select the gfx810 processor. gfx9-4-generic - Select the gfx9-4-generic processor. gfx9-generic - Select the gfx9-generic processor. gfx900 - Select the gfx900 processor. gfx902 - Select the gfx902 processor. gfx904 - Select the gfx904 processor. gfx906 - Select the gfx906 processor. gfx908 - Select the gfx908 processor. gfx909 - Select the gfx909 processor. gfx90a - Select the gfx90a processor. gfx90c - Select the gfx90c processor. gfx940 - Select the gfx940 processor. gfx941 - Select the gfx941 processor. gfx942 - Select the gfx942 processor. gfx950 - Select the gfx950 processor. hainan - Select the hainan processor. hawaii - Select the hawaii processor. iceland - Select the iceland processor. kabini - Select the kabini processor. kaveri - Select the kaveri processor. mullins - Select the mullins processor. oland - Select the oland processor. pitcairn - Select the pitcairn processor. polaris10 - Select the polaris10 processor. polaris11 - Select the polaris11 processor. stoney - Select the stoney processor. tahiti - Select the tahiti processor. tonga - Select the tonga processor. tongapro - Select the tongapro processor. verde - Select the verde processor.

Also not sure if we want to limit the possible number of hits for the first number based on what follows eg

rgx1 = re.compile(r'gfx[0-9]{3,4}') rgx2 = re.compile(r'gfx[0-9]{2,3}[a-z]') rgx3 = re.compile(r'gfx[0-9]{1,2}(\-[0-9])?\-generic')

I wouldn't expect to see --generic at all. We should treat this as an invalid pattern.
My regex knowledge is limited in that regard though, so any better idea for a catching this is appreciated 😄

Also not sure if we want to limit the possible number of hits for the first number based on what follows

Hm, I'd probably leave this a bit more generic, to make sure that we don't have to update this regularly. I wouldn't expect AMD to add generic targets for something like gfx600, but who knows what will be introduced in the future. Our check for cuda-compute-capabilities is also fairly generic.

If --generic is never a thing i think having them grouped is the way to go

>>> import re >>> rgx = re.compile(r'gfx[0-9]+(\-[0-9])?-generic$') >>> correct = ['gfx10-1-generic', 'gfx10-3-generic', 'gfx11-generic', 'gfx12-ge\ neric', 'gfx9-4-generic', 'gfx9-generic'] >>> wrong = ['gfx10-1', 'gfx10--generic'] >>> [rgx.match(_) for _ in correct] [<re.Match object; span=(0, 15), match='gfx10-1-generic'>, <re.Match object; span=(0, 15), match='gfx10-3-generic'>, <re.Match object; span=(0, 13), match='gfx11-generic'>, <re.Match object; span=(0, 13), match='gfx12-generic'>, <re.Match object; span=(0, 14), match='gfx9-4-generic'>, <re.Match object; span=(0, 12), match='gfx9-generic'>] >>> [rgx.match(_) for _ in wrong] [None, None] >>>

if you do them without grouping also --generic would be accepted

>>> rgx = re.compile(r'gfx[0-9]+[-]?[0-9]?-generic$') >>> [rgx.match(_) for _ in wrong] [None, <re.Match object; span=(0, 14), match='gfx10--generic'>]

Thyre commented Apr 25, 2025

View reviewed changes

easybuild/framework/easyconfig/default.py Outdated Show resolved Hide resolved

Thyre force-pushed the support-passing-amdgcn branch from 958ad0a to bff1bfb Compare April 25, 2025 21:39

Thyre commented Apr 25, 2025

View reviewed changes

test/framework/easyconfig.py Outdated Show resolved Hide resolved

Thyre force-pushed the support-passing-amdgcn branch from bff1bfb to 0e7aaf3 Compare April 25, 2025 22:55

Thyre changed the title ~~Add AMDGCN options similar to cuda-compute-capabilities~~ Add AMDGCN option similar to cuda-compute-capabilities Apr 25, 2025

Thyre mentioned this pull request Apr 25, 2025

Enhance LLVM EasyBlock to better handle offload builds easybuilders/easybuild-easyblocks#3675

Merged

boegel added the enhancement label May 7, 2025

boegel added this to the 5.x milestone May 7, 2025

Thyre force-pushed the support-passing-amdgcn branch from 0e7aaf3 to d4ba387 Compare May 10, 2025 12:14

Thyre mentioned this pull request Jun 4, 2025

Move Clang easyconfigs to use LLVM easyblock easybuilders/easybuild-easyconfigs#23028

Closed

Thyre force-pushed the support-passing-amdgcn branch 2 times, most recently from 9f3fc25 to 5a82798 Compare June 19, 2025 13:15

This was referenced Jul 3, 2025

Draft: New EasyBlock for ROCm-LLVM easybuilders/easybuild-easyblocks#3823

Draft

LLVM: Adopt build option amdgcn_capabilities easybuilders/easybuild-easyblocks#3824

Open

{tools}[GCCcore/14.2.0] ROCm-LLVM v6.4.1 easybuilders/easybuild-easyconfigs#23304

Draft

Thyre marked this pull request as ready for review July 3, 2025 19:04

Thyre force-pushed the support-passing-amdgcn branch from 5a82798 to db9a681 Compare July 3, 2025 19:04

Thyre and others added 4 commits July 7, 2025 06:28

Add AMDGCN compute capability option

fe88dcf

Signed-off-by: Jan Andre Reuter <[email protected]>

Remove compute from AMDGCN naming

052f5ee

AMD doesn't name this compute capabilities, and amdhsa is only used when lowering to HSA (but amdpal & mesa3d are also possible). Therefore, simple the name option 'amdgcn-capabilities'. Signed-off-by: Jan Andre Reuter <[email protected]>

Allow users to pass empty AMDGCN list

fcc85f1

This allows users to handle cases like LLVM, where building with GPU support is optional, but users might still want to install the software without GPU support. Signed-off-by: Jan Andre Reuter <[email protected]>

AMDGCN: Fix test typo

4af19e3

Signed-off-by: Jan André Reuter <[email protected]>

Thyre force-pushed the support-passing-amdgcn branch from db9a681 to 4af19e3 Compare July 7, 2025 04:32

Add required argument to get_amdgcn_cc_template_value

afa6558

Signed-off-by: Jan André Reuter <[email protected]>

Thyre force-pushed the support-passing-amdgcn branch from 6e32eac to afa6558 Compare July 7, 2025 04:44

Micket approved these changes Jul 15, 2025

View reviewed changes

Crivella reviewed Jul 23, 2025

View reviewed changes

Thyre mentioned this pull request Jul 23, 2025

{tools}[GCCcore/14.3.0] lit v18.1.8, LLVM v20.1.8, psutil v7.0.0, ... easybuilders/easybuild-easyconfigs#23459

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add AMDGCN option similar to `cuda-compute-capabilities` #4860

Add AMDGCN option similar to `cuda-compute-capabilities` #4860

Thyre commented Apr 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Thyre commented May 10, 2025 •

edited

Loading

Uh oh!

Micket left a comment

Uh oh!

Micket commented Jul 15, 2025

Uh oh!

Thyre commented Jul 15, 2025

Uh oh!

Micket commented Jul 15, 2025

Uh oh!

Thyre commented Jul 15, 2025

Uh oh!

Crivella Jul 23, 2025 •

edited

Loading

Uh oh!

Thyre Jul 23, 2025 •

edited

Loading

Uh oh!

Crivella Jul 23, 2025

Uh oh!

Uh oh!

	amdgcn_generic_regex = re.compile(r'gfx[0-9]+[-]?[0-9]?-generic$')
	amdgcn_generic_regex = re.compile(r'gfx[0-9]+(\-[0-9])?-generic$')

Add AMDGCN option similar to cuda-compute-capabilities #4860

Are you sure you want to change the base?

Add AMDGCN option similar to cuda-compute-capabilities #4860

Conversation

Thyre commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Missing features compared to CUDA

Known issues

Uh oh!

Uh oh!

Uh oh!

Thyre commented May 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Micket left a comment

Choose a reason for hiding this comment

Uh oh!

Micket commented Jul 15, 2025

Uh oh!

Thyre commented Jul 15, 2025

Uh oh!

Micket commented Jul 15, 2025

Uh oh!

Thyre commented Jul 15, 2025

Uh oh!

Crivella Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Thyre Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Crivella Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Add AMDGCN option similar to `cuda-compute-capabilities` #4860

Add AMDGCN option similar to `cuda-compute-capabilities` #4860

Thyre commented Apr 25, 2025 •

edited

Loading

Thyre commented May 10, 2025 •

edited

Loading

Crivella Jul 23, 2025 •

edited

Loading

Thyre Jul 23, 2025 •

edited

Loading