Various fixes for dynamic kernel loading #3507

1tnguyen · 2025-10-10T03:51:20Z

Description

(1) Allow override in registerDeviceKernel

(2) Clear kernel decorator's module (parsed from walking the Python AST) as we switch target.
The lack of this module attribute tells the decorator to perform a proper compile().

Note: Other global maps might be populated during kernel constructor (i.e., as Python sees kernels); hence might not be safe to clear.

(3) Python to honor the default simulator environment variable for hardware target emulation. It was hardcoded to qpp for those targets that don't define a simulator.

Signed-off-by: Thien Nguyen <[email protected]>

python/utils/LinkedLibraryHolder.cpp

github-actions · 2025-10-10T07:07:20Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

Signed-off-by: Thien Nguyen <[email protected]>

github-actions · 2025-10-15T00:02:07Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

github-actions · 2025-10-24T04:20:36Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

Signed-off-by: Thien Nguyen <[email protected]>

github-actions · 2025-10-24T08:14:50Z

CUDA Quantum Docs Bot: A preview of the documentation can be found here.

python/cudaq/__init__.py

sacpis

LGTM. Thanks @1tnguyen.

schweitzpgi · 2025-10-24T14:42:17Z

I'm going to very likely revert this with my changes.

Yes. I'll revert these changes in my total rewrite of the python handling of builders, decorators, compilation, and launching.

Signed-off-by: Eric Schweitz <[email protected]>

The current implementation of the Python handling of CUDA-Q has baked in various attempts to deal with the language coupling between Python and CUDA-Q kernels. These solutions have been accumulating and making it more and more difficult to work on the Python implementation. These changes are a total rewrite to bring the Python implementation more closely aligned with the C++ implementation. Changes: - The kernel builder and kernel decorator are fundamentally different and will no longer share a duck-typed interface. It doesn't work well. The builder assembles a CUDA-Q kernel dynamically. As such all symbolic references are known immediately. The decorator converts a static AST of code into a CUDA-Q kernel. Symbolic references are either local or not. Non-local symbols are unknown at the point the decorator is processed. All non-local symbols in a decorator are recorded with the decorator itself and lambda lifted as actual arguments. - MLIR requires that symbols be uniqued. The previous implementation ignored this requirement. - Lazy state maintenance in Python and the C++ runtime layers is buggy and not needed. It is removed. This includes dangling MLIR bindings from the AST bridge's python MLIR bindings. - Kernels are no longer built with assumptions, then rebuilt when those guesses prove wrong. Kernels are no longer built and rebuilt for different steps in the process. A kernel decorator builds a target agnostic, context independent kernel, and saves that MLIR ModuleOp under a unique name. - Launch scenarios have been reworked and consolidated to use the ModuleOp directly instead of shuffling between string representations (possibly under maps that were not thread-safe) and ModuleOp instances. - Every step of the process creating a brand new MLIRContext and loading all the dialects into that context, etc. is removed. This is done once and the Python interpreter uses the same context to build all modules. These changes also revert some work on the bridge meant to fix bugs that was in conflict. This includes NVIDIA#3507, NVIDIA#3545. Signed-off-by: Eric Schweitz <[email protected]>

1tnguyen added 4 commits October 9, 2025 06:17

Override code in registerDeviceKernel

94eb312

Signed-off-by: Thien Nguyen <[email protected]>

Clear cache

c821bd9

Signed-off-by: Thien Nguyen <[email protected]>

Only clear kernel module

4d6881e

Signed-off-by: Thien Nguyen <[email protected]>

Python to take into account default sim for emulation

5ab959f

Signed-off-by: Thien Nguyen <[email protected]>

sacpis reviewed Oct 10, 2025

View reviewed changes

python/utils/LinkedLibraryHolder.cpp Show resolved Hide resolved

sacpis reviewed Oct 10, 2025

View reviewed changes

python/utils/LinkedLibraryHolder.cpp Outdated Show resolved Hide resolved

sacpis reviewed Oct 10, 2025

View reviewed changes

python/utils/LinkedLibraryHolder.cpp Show resolved Hide resolved

github-actions bot pushed a commit that referenced this pull request Oct 10, 2025

Docs preview for PR #3507.

5099cc9

1tnguyen and others added 2 commits October 15, 2025 09:11

Merge branch 'main' into tnguyen/device-kernel-registry

3d32d2a

Address code review comments and format code

9d92280

Signed-off-by: Thien Nguyen <[email protected]>

github-actions bot pushed a commit that referenced this pull request Oct 15, 2025

Docs preview for PR #3507.

d8f12b9

1tnguyen marked this pull request as ready for review October 24, 2025 02:35

Merge branch 'main' into tnguyen/device-kernel-registry

3ede0e8

github-actions bot pushed a commit that referenced this pull request Oct 24, 2025

Docs preview for PR #3507.

4f00c0a

1tnguyen and others added 2 commits October 24, 2025 05:09

Fix an issue with the special flag

5ca4e94

Signed-off-by: Thien Nguyen <[email protected]>

Merge branch 'main' into tnguyen/device-kernel-registry

13abb4e

github-actions bot pushed a commit that referenced this pull request Oct 24, 2025

Docs preview for PR #3507.

f10103b

sacpis reviewed Oct 24, 2025

View reviewed changes

python/cudaq/__init__.py Show resolved Hide resolved

sacpis approved these changes Oct 24, 2025

View reviewed changes

1tnguyen merged commit 67fd323 into NVIDIA:main Oct 24, 2025
193 checks passed

github-actions bot pushed a commit that referenced this pull request Oct 24, 2025

Cleaning up docs preview for PR #3507.

b2ee116

1tnguyen mentioned this pull request Oct 26, 2025

[Python] Use cudaq.complex() rather than fixed complex type #3549

Merged

schweitzpgi added a commit to schweitzpgi/cuda-quantum that referenced this pull request Oct 31, 2025

Revert NVIDIA#3507. Superceded by doing it the correct way.

f8f2506

Signed-off-by: Eric Schweitz <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Various fixes for dynamic kernel loading #3507

Various fixes for dynamic kernel loading #3507

Uh oh!

1tnguyen commented Oct 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Oct 10, 2025

Uh oh!

github-actions bot commented Oct 15, 2025

Uh oh!

github-actions bot commented Oct 24, 2025

Uh oh!

github-actions bot commented Oct 24, 2025

Uh oh!

Uh oh!

sacpis left a comment

Uh oh!

Uh oh!

schweitzpgi commented Oct 24, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Various fixes for dynamic kernel loading #3507

Various fixes for dynamic kernel loading #3507

Uh oh!

Conversation

1tnguyen commented Oct 10, 2025

Description

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Oct 10, 2025

Uh oh!

github-actions bot commented Oct 15, 2025

Uh oh!

github-actions bot commented Oct 24, 2025

Uh oh!

github-actions bot commented Oct 24, 2025

Uh oh!

Uh oh!

sacpis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

schweitzpgi commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

schweitzpgi commented Oct 24, 2025 •

edited

Loading