Skip to content

Commit ae94db4

Browse files
authored
[SYCL][CUDA][DOC] State min. CUDA ver. for sm_75 (#5642)
This patch adds a note on the Get Started Guide regarding the minimum CUDA toolkit version required for fully utilize Turing devices (sm_75). CUDA toolkit version 11.0 introduces PTX7.0. This version supports for the first time the Ampere architecture (sm_80), however some instructions introduced by PTX7.0 (e.g. approximated tanh (#5265) and ex2 for halfs) can be executed also by Turing devices (sm_75), if CUDA 11.0 (or above) is installed. Compilation on Turing devices is possible also using CUDA 10.2 (the actual version reported as tested), however if one these PTX7.0 instruction is used, it will generate an error.
1 parent 5881938 commit ae94db4

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

sycl/doc/GetStartedGuide.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -177,6 +177,9 @@ the system, refer to
177177
[NVIDIA CUDA Installation Guide for Linux](https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html)
178178
or
179179
[NVIDIA CUDA Installation Guide for Windows](https://docs.nvidia.com/cuda/cuda-installation-guide-microsoft-windows/index.html)
180+
An installation of at least
181+
[CUDA 11.0](https://developer.nvidia.com/cuda-11.0-download-archive)
182+
is required for fully utilize Turing (SM 75) devices.
180183

181184
Currently, the only combination tested is Ubuntu 18.04 with CUDA 10.2 using
182185
a Titan RTX GPU (SM 71). The CUDA backend should work on Windows or Linux

0 commit comments

Comments
 (0)