Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -142,6 +142,10 @@ This section is non-normative and applies only to the {dpcpp} implementation.
The table below explains how {dpcpp} calculates the number of compute units for
different combinations of device and backend.

:intel-link: https://www.intel.com/content/www/us/en/docs/oneapi/optimization-guide-gpu/2025-2/intel-xe-gpu-architecture.html#INTEL-R-XE-GPU-BUILDING-BLOCKS
:nvidia-link: https://docs.nvidia.com/deeplearning/performance/dl-performance-gpu-background/index.html#gpu-arch
:amd-link: https://rocm.docs.amd.com/projects/HIP/en/latest/understand/hardware_implementation.html#hierarchical-organization

[%header,cols="1,5,10"]
|===
|Device Type
Expand All @@ -150,19 +154,21 @@ different combinations of device and backend.

|CPU
|OpenCL
|Number of logical cores.
|Number of logical cores

|Intel GPU
|Any
|Number of Xe cores.
|Number of Xe cores (link:{intel-link}[link])

|NVIDIA GPU
|Any
|Number of streaming multiprocessors (SMs).
|Number of streaming multiprocessors (SMs) (link:{nvidia-link}[link])

|AMD GPU
|Any
|Number of compute units (CUs) (link:{amd-link}[link])
|===


== Issues

None.