Device Agnostic backend API & CUDA support #214

tom-pollak · 2025-10-09T11:04:47Z

Motivation

Refactor to provide a device-agnostic backend API, with optional device-specific functionality.

Changes

Backend Detection (iris/backend/init.py)

Auto-detects available GPU runtime at import time.
Provides unified API across devices, refactor iris.hip -> iris.backend
Can be forced via IRIS_BACKEND environment variable

API Usage

# Portable code - works on both CUDA and AMD
import iris.backend as backend
backend.set_device(0)
ptr = backend.malloc(size)

# Platform-specific features
from iris.backend import hip
ptr = hip.malloc_fine_grained(size)  # AMD: cache-coherent memory

from iris.backend import cuda
ptr = cuda.malloc_managed(size)  # NVIDIA: unified memory

Migration

All iris.hip.* calls updated to iris.backend.*
Reverts previous cuda implementation by copilot in Add CUDA backend support for NVIDIA GPUs with automatic detection #200

Device Interface

Note no use of malloc_fine_grained, get_num_xcc should probably also be hip specific but its used in the examples. For now CUDA sets this to 1.

def set_device(gpu_id: int) -> None
def get_device_id() -> int
def count_devices() -> int
def get_cu_count(device_id: int | None = None) -> int
def get_wall_clock_rate(device_id: int) -> int
def get_arch_string(device_id: int | None = None) -> str
def get_num_xcc(device_id: int | None = None) -> int
def get_runtime_version() -> tuple[int, int]  # (major, minor)

def get_ipc_handle(ptr: int | ctypes.c_void_p, rank: int) -> Any
def open_ipc_handle(ipc_handle_data: np.ndarray, rank: int) -> int

def malloc(size: int) -> ctypes.c_void_p
def free(ptr: int | ctypes.c_void_p) -> None

Any comments and feedback would be welcome!

…tion (ROCm#200)" This reverts commit 6ba9c79.

hip_malloc -> malloc hip_free -> free get_rocm_version -> get_runtime_version

tom-pollak added 7 commits October 9, 2025 11:07

Revert "Add CUDA backend support for NVIDIA GPUs with automatic detec…

c5bb0d9

…tion (ROCm#200)" This reverts commit 6ba9c79.

added cuda bindings

854d824

auto load backend

38c76ed

porting

045d327

more compat api

141b741

hip_malloc -> malloc hip_free -> free get_rocm_version -> get_runtime_version

add readme

9d655fc

backend specific functionality

7f51167

tom-pollak requested review from BKP, mawad-amd and neoblizz as code owners October 9, 2025 11:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Device Agnostic backend API & CUDA support #214

Device Agnostic backend API & CUDA support #214

Uh oh!

tom-pollak commented Oct 9, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Device Agnostic backend API & CUDA support #214

Are you sure you want to change the base?

Device Agnostic backend API & CUDA support #214

Uh oh!

Conversation

tom-pollak commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Changes

API Usage

Migration

Device Interface

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

tom-pollak commented Oct 9, 2025 •

edited

Loading