Add Runtime dispatch based on custom CPU capabilities function #607

willieyz · 2025-11-04T11:14:00Z

Resolves Port run-time dispatch from mlkem-native #609
This PR ported: Runtime dispatch based on custom CPU capabilities function mlkem-native#1152

This PR is still working in progress:

Ported:
- Runtime dispatch mechanism for AArch64, x86 native, and FIPS202 APIs.
- Configuration for static ON/OFF, AVX2 and AA64PFR1_EL1, as well as the corresponding config-variations tests in CI.
Not yet ported:
- ec2_functests are not included in CI yet, since this section does not exist in mldsa-native. We have already opened Add ec2_functests in ci.yml #608 and will port it in a separate PR before finalizing this one.
  - PR 608 merged, enable in this PR.

There is currently a constant-time leak issue due to the runtime dispatch change in mld_poly_chknorm:

Lines 589 to 604 in 05ad5ed

    
           MLD_INTERNAL_API 
        
           uint32_t mld_poly_chknorm(const mld_poly *a, int32_t B) 
        
           { 
        
             unsigned int i; 
        
             uint32_t t = 0; 
        
           #if defined(MLD_USE_NATIVE_POLY_CHKNORM) 
        
             int ret; 
        
             /* TODO: proof */ 
        
             mld_assert_bound(a->coeffs, MLDSA_N, -REDUCE32_RANGE_MAX, REDUCE32_RANGE_MAX); 
        
             ret = mld_poly_chknorm_native(a->coeffs, B); 
        
             if (ret != MLD_NATIVE_FUNC_FALLBACK) 
        
             { 
        
               return ret == 0 ? 0 : 0xFFFFFFFF; 
        
             } 
        
           #endif /* MLD_USE_NATIVE_POLY_CHKNORM */ 
        
             mld_assert_bound(a->coeffs, MLDSA_N, -REDUCE32_RANGE_MAX, REDUCE32_RANGE_MAX);

This has been discussed with @mkannwischer, and asking for help to add the fix.

- This commit adds EC2 functional tests for mldsa-native, similar to what we have in mlkem-native. - Unlike other functional tests that run on GitHub runners, the EC2 tests provide a bit more flexibility and control over the actual CPU architecture being tested. - During this work, we found that the CI did not raise an error when specifying a config_variations value that does not exist. For example, the previous commit passed several non-existent config headers from PR #607, which has not been merged yet. The CI silently accepted these missing configurations without any error. - Since the configuration header from PR #607 is not yet merged, we temporarily comment out the config_variations parameter. Once PR #607 is ported, we will re-enable it. - We also plan to add test ID validation, similar to mlkem-native PR #1291, to prevent false-positive test results caused by invalid configuration entries. Signed-off-by: willieyz <[email protected]>

- This commit adds EC2 functional tests for mldsa-native, similar to what we have in mlkem-native. - Unlike other functional tests that run on GitHub runners, the EC2 tests provide a bit more flexibility and control over the actual CPU architecture being tested. - During this work, we found that the CI did not raise an error when specifying a config_variations value that does not exist. For example, In this commit, We passed several non-existent config headers from PR #607, which has not been merged yet. The CI silently accepted these missing configurations without any error. - Since the configuration header from PR #607 is not yet merged, we temporarily comment out the config_variations parameter. Once PR #607 is ported, we will re-enable it. - We also plan to add test ID validation, similar to mlkem-native PR #1291, to prevent false-positive test results caused by invalid configuration entries. Signed-off-by: willieyz <[email protected]>

Previously the native chknorm functions would follow the C implementation, i.e., return 0 if all coefficients are within bound and 0xFFFFFFFF otherwise. This leads to problems with the run-time dispatch in #607 as we want to use -1 to signal that the current platforms lacks the required capabilities to run the native code, and we should fall back to the C implementation. This commit changes the backend API to return 1 in the failure mode. This will allow to implement the run-time dispatch.

Previously the native chknorm functions would follow the C implementation, i.e., return 0 if all coefficients are within bound and 0xFFFFFFFF otherwise. This leads to problems with the run-time dispatch in #607 as we want to use -1 to signal that the current platforms lacks the required capabilities to run the native code, and we should fall back to the C implementation. This commit changes the backend API to return 1 in the failure mode. This will allow to implement the run-time dispatch. Signed-off-by: Matthias J. Kannwischer <[email protected]>

…te_bitrev_to_custom) - Change mld_ntt_native() return type from void to int - Add runtime capability checking with fallback support - Implement dispatch logic in mld_poly_ntt() to try native first, fallback to C - Add MLD_NATIVE_FUNC_SUCCESS/FALLBACK return codes - Add mld_sys_check_capability() for system capability detection - Add test configuration for AVX2, static ON/OFF, add to CI test. Signed-off-by: willieyz <[email protected]>

…a4_native) Signed-off-by: willieyz <[email protected]>

Signed-off-by: willieyz <[email protected]>

…native) Signed-off-by: willieyz <[email protected]>

Signed-off-by: willieyz <[email protected]>

- This commit temporary add MLD_NATIVE_FUNC_SUCCESS to exception, this macro will introduced after PR #607, remove after it merged. - This commit re-run autogen Signed-off-by: willieyz <[email protected]>

willieyz changed the title ~~Runtime dispatch~~ Add Runtime dispatch based on custom CPU capabilities function Nov 4, 2025

willieyz force-pushed the runtime-dispatch branch 5 times, most recently from 6332a01 to b0ac532 Compare November 4, 2025 12:24

This was referenced Nov 4, 2025

Align native api.h with meta.h #606

Merged

Add ec2_functests in ci.yml #608

Closed

willieyz force-pushed the runtime-dispatch branch 22 times, most recently from 74996f5 to 3e7a817 Compare November 6, 2025 11:47

mkannwischer mentioned this pull request Nov 12, 2025

native chknorm: 0/0xFFFFFFFF -> 0/1 #677

Merged

willieyz added 10 commits November 13, 2025 16:58

Add runtime dispatch (mld_rej_uniform_native, mld_rej_uniform_eta2/et…

b51f45a

…a4_native) Signed-off-by: willieyz <[email protected]>

Add runtime dispatch (mld_poly_decompose_32/88_native)

e2db90b

Signed-off-by: willieyz <[email protected]>

Add runtime dispatch (mld_poly_caddq_native)

5b860f8

Signed-off-by: willieyz <[email protected]>

Add runtime dispatch (mld_poly_use_hint_32/88_native)

93b7f1a

Signed-off-by: willieyz <[email protected]>

Add runtime dispatch (mld_poly_chknorm)

ecf6816

Signed-off-by: willieyz <[email protected]>

Add runtime dispatch (mld_polyz_unpack_17/19_native)

32dfef3

Signed-off-by: willieyz <[email protected]>

Add runtime dispatch (mld_poly_pointwise_montgomery_native)

234878d

Signed-off-by: willieyz <[email protected]>

Add runtime dispatch (mld_polyvecl_pointwise_acc_montgomery_l4/l5/l7_…

65c0e23

…native) Signed-off-by: willieyz <[email protected]>

Add runtime dispatch (FIPS202)

dc8208e

Signed-off-by: willieyz <[email protected]>

willieyz force-pushed the runtime-dispatch branch from b1a6654 to dc8208e Compare November 13, 2025 09:00

ec2_functests: Remove TODO and enable the config-variations test

2fb636e

Signed-off-by: willieyz <[email protected]>

willieyz mentioned this pull request Nov 13, 2025

Example: Add custom_backend #699

Open

mkannwischer mentioned this pull request Nov 17, 2025

Possible mis-matched return types in AArch64 back-end #703

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Runtime dispatch based on custom CPU capabilities function #607

Add Runtime dispatch based on custom CPU capabilities function #607

Uh oh!

willieyz commented Nov 4, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	MLD_INTERNAL_API
	uint32_t mld_poly_chknorm(const mld_poly *a, int32_t B)
	{
	unsigned int i;
	uint32_t t = 0;
	#if defined(MLD_USE_NATIVE_POLY_CHKNORM)
	int ret;
	/* TODO: proof */
	mld_assert_bound(a->coeffs, MLDSA_N, -REDUCE32_RANGE_MAX, REDUCE32_RANGE_MAX);
	ret = mld_poly_chknorm_native(a->coeffs, B);
	if (ret != MLD_NATIVE_FUNC_FALLBACK)
	{
	return ret == 0 ? 0 : 0xFFFFFFFF;
	}
	#endif /* MLD_USE_NATIVE_POLY_CHKNORM */
	mld_assert_bound(a->coeffs, MLDSA_N, -REDUCE32_RANGE_MAX, REDUCE32_RANGE_MAX);

Add Runtime dispatch based on custom CPU capabilities function #607

Are you sure you want to change the base?

Add Runtime dispatch based on custom CPU capabilities function #607

Uh oh!

Conversation

willieyz commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

willieyz commented Nov 4, 2025 •

edited

Loading