[tmva][sofie] Improve AD-friendlieness of emitted code for Clad by guitargeek · Pull Request #21896 · root-project/root

guitargeek · 2026-04-12T11:57:51Z

This commit refactors SOFIE-generated inference code to enable correct and efficient reverse-mode automatic differentiation with Clad.

Key changes:

Introduce explicit primitive operations (Copy, Fill, Relu) in SOFIE_common.hxx and provide corresponding custom pullbacks in CladDerivator.h. This replaces previously inlined loops and allows Clad to generate efficient gradient code without relying on tapes or loop-level differentiation.
Update Gemm code generation to emit Copy/Fill instead of manually expanding bias initialization loops. This better exposes the intent and improves AD performance and correctness.
Replace manual ReLU loops with a dedicated Relu() call, enabling a custom pullback that avoids tape-based condition tracking.
Generate an additional "unoptimized" model variant in the SOFIE test suite (OptimizationLevel::kBasic), and use it for AD tests. This disables memory reuse of intermediate tensors. Opaque memory reuse is safe for inference but breaks source-transformation AD.
Improve gradient test diagnostics in SOFIE Clad tests by reporting mismatched indices instead of only checking a global max difference.

With these changes, Clad-generated gradients for SOFIE models are both correct and significantly faster, reaching performance comparable to frameworks such as PyTorch and JAX on the CPU for the tested cases (fully-connected neural networks with multiple layers).

github-actions · 2026-04-12T13:34:32Z

Test Results

22 files 22 suites 3d 7h 19m 23s ⏱️
3 833 tests 3 831 ✅ 1 💤 1 ❌
76 552 runs 76 533 ✅ 18 💤 1 ❌

For more details on these failures, see this check.

Results for commit c558240.

♻️ This comment has been updated with latest results.

lmoneta

LGTM!
Thank you, Jonas, for these improvements, which help CLAD. I have only a question on whether is better to impelment a Copy function or using directly std::copy.

tmva/sofie/inc/TMVA/SOFIE_common.hxx

This commit refactors SOFIE-generated inference code to enable correct and efficient reverse-mode automatic differentiation with Clad. Key changes: * Introduce explicit primitive operations (`Copy`, `Fill`, `Relu`) in SOFIE_common.hxx and provide corresponding custom pullbacks in CladDerivator.h. This replaces previously inlined loops and allows Clad to generate efficient gradient code without relying on tapes or loop-level differentiation. * Update Gemm code generation to emit Copy/Fill instead of manually expanding bias initialization loops. This better exposes the intent and improves AD performance and correctness. * Replace manual ReLU loops with a dedicated Relu() call, enabling a custom pullback that avoids tape-based condition tracking. * Generate an additional "unoptimized" model variant in the SOFIE test suite (`OptimizationLevel::kBasic`), and use it for AD tests. This disables memory reuse of intermediate tensors. Opaque memory reuse is safe for inference but breaks source-transformation AD. * Improve gradient test diagnostics in SOFIE Clad tests by reporting mismatched indices instead of only checking a global max difference. With these changes, Clad-generated gradients for SOFIE models are both correct and significantly faster, reaching performance comparable to frameworks such as PyTorch and JAX on the CPU for the tested cases (fully-connected neural networks with multiple layers).

guitargeek self-assigned this Apr 12, 2026

guitargeek added the in:SOFIE label Apr 12, 2026

guitargeek requested a review from lmoneta as a code owner April 12, 2026 11:57

lmoneta approved these changes Apr 14, 2026

View reviewed changes

tmva/sofie/inc/TMVA/SOFIE_common.hxx Show resolved Hide resolved

guitargeek force-pushed the sofie_clad_continued branch from c558240 to b7655c7 Compare April 14, 2026 09:30

guitargeek merged commit 47bd6d4 into root-project:master Apr 14, 2026
28 of 29 checks passed

guitargeek deleted the sofie_clad_continued branch April 14, 2026 11:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tmva][sofie] Improve AD-friendlieness of emitted code for Clad#21896

[tmva][sofie] Improve AD-friendlieness of emitted code for Clad#21896
guitargeek merged 1 commit intoroot-project:masterfrom
guitargeek:sofie_clad_continued

guitargeek commented Apr 12, 2026

Uh oh!

github-actions bot commented Apr 12, 2026 •

edited

Loading

Uh oh!

lmoneta left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

guitargeek commented Apr 12, 2026

Uh oh!

github-actions bot commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results

Uh oh!

lmoneta left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions bot commented Apr 12, 2026 •

edited

Loading