You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/developer_guide/build.md
+55-34Lines changed: 55 additions & 34 deletions
Original file line number
Diff line number
Diff line change
@@ -6,13 +6,14 @@ HIP code can be developed either on AMD ROCm platform using HIP-Clang compiler,
6
6
Before build and run HIP, make sure drivers and pre-build packages are installed properly on the platform.
7
7
8
8
### AMD platform
9
-
Install ROCm packages (see ROCm Installation Guide on AMD public documentation site (https://docs.amd.com/)) or install pre-built binary packages using the package manager,
9
+
Install ROCm packages (see ROCm Installation Guide on AMD public documentation site (https://docs.amd.com/) or install pre-built binary packages using the package manager,
10
10
11
11
```shell
12
12
sudo apt install mesa-common-dev
13
13
sudo apt install clang
14
14
sudo apt install comgr
15
15
sudo apt-get -y install rocm-dkms
16
+
sudo apt-get install -y libelf-dev
16
17
```
17
18
18
19
### NVIDIA platform
@@ -22,14 +23,14 @@ Install Nvidia driver and pre-build packages (see HIP Installation Guide at http
22
23
### Branch of repository
23
24
24
25
Before get HIP source code, set the expected branch of repository at the variable `ROCM_BRANCH`.
25
-
For example, for ROCm5.0 release branch, set
26
+
For example, for ROCm5.6 release branch, set
26
27
```shell
27
-
export ROCM_BRANCH=rocm-5.0.x
28
+
export ROCM_BRANCH=rocm-5.6.x
28
29
```
29
30
30
-
ROCm5.4 release branch, set
31
+
ROCm5.6 release branch, set
31
32
```shell
32
-
export ROCM_BRANCH=rocm-5.4.x
33
+
export ROCM_BRANCH=rocm-5.6.x
33
34
```
34
35
Similiar format for future branches.
35
36
@@ -42,38 +43,47 @@ Similiar format for future branches.
ROCclr is defined on AMD platform that HIP use Radeon Open Compute Common Language Runtime (ROCclr), which is a virtual device interface that HIP runtimes interact with different backends.
59
-
See https://github.com/ROCm-Developer-Tools/ROCclr
59
+
Note, starting from ROCM 5.6 release, clr is a new repository including the previous ROCclr, HIPAMD and OpenCl repositories.
60
+
ROCclr is defined on AMD platform that HIP uses Radeon Open Compute Common Language Runtime (ROCclr), which is a virtual device interface that HIP runtimes interact with different backends.
61
+
HIPAMD provides implementation specifically for AMD platform.
62
+
OpenCL provides headers that ROCclr runtime currently depends on.
60
63
61
-
HIPAMD repository provides implementation specifically for AMD platform.
62
-
See https://github.com/ROCm-Developer-Tools/hipamd
| AMD_LOG_LEVEL <br><sub> Enable HIP log on different Level. </sub> | 0 | 0: Disable log. <br> 1: Enable log on error level. <br> 2: Enable log on warning and below levels. <br> 0x3: Enable log on information and below levels. <br> 0x4: Decode and display AQL packets. |
246
246
| AMD_LOG_MASK <br><sub> Enable HIP log on different Level. </sub> | 0x7FFFFFFF | 0x1: Log API calls. <br> 0x02: Kernel and Copy Commands and Barriers. <br> 0x4: Synchronization and waiting for commands to finish. <br> 0x8: Enable log on information and below levels. <br> 0x20: Queue commands and queue contents. <br> 0x40:Signal creation, allocation, pool. <br> 0x80: Locks and thread-safety code. <br> 0x100: Copy debug. <br> 0x200: Detailed copy debug. <br> 0x400: Resource allocation, performance-impacting events. <br> 0x800: Initialization and shutdown. <br> 0x1000: Misc debug, not yet classified. <br> 0x2000: Show raw bytes of AQL packet. <br> 0x4000: Show code creation debug. <br> 0x8000: More detailed command info, including barrier commands. <br> 0x10000: Log message location. <br> 0xFFFFFFFF: Log always even mask flag is zero. |
247
-
| HIP_VISIBLE_DEVICES <br><sub> Only devices whose index is present in the sequence are visible to HIP. </sub> || 0,1,2: Depending on the number of devices on the system. |
247
+
| HIP_VISIBLE_DEVICES(or CUDA_VISIBLE_DEVICES) <br><sub> Only devices whose index is present in the sequence are visible to HIP. </sub> || 0,1,2: Depending on the number of devices on the system. |
Copy file name to clipboardExpand all lines: docs/user_guide/faq.md
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -226,7 +226,7 @@ If you have compiled the application yourself, make sure you have given the corr
226
226
If you have a precompiled application/library (like rocblas, tensorflow etc) which gives you such error, there are one of two possibilities.
227
227
228
228
- The application/library does not ship code object bundles for *all* of your device(s): in this case you need to recompile the application/library yourself with correct `--offload-arch`.
229
-
- The application/library does not ship code object bundles for *some* of your device(s), for example you have a system with an APU + GPU and the library does not ship code objects for your APU. For this you can set the environment variable `HIP_VISIBLE_DEVICES` to only enable GPUs for which code object is available. This will limit the GPUs visible to your application and allow it to run.
229
+
- The application/library does not ship code object bundles for *some* of your device(s), for example you have a system with an APU + GPU and the library does not ship code objects for your APU. For this you can set the environment variable `HIP_VISIBLE_DEVICES`or `CUDA_VISIBLE_DEVICES` on NVdia platform, to only enable GPUs for which code object is available. This will limit the GPUs visible to your application and allow it to run.
0 commit comments