Fix incorrect cast from BF16 to FP32 in SBGEMM by murste01 · Pull Request #5712 · OpenMathLib/OpenBLAS

murste01 · 2026-03-26T13:25:37Z

This change fixes a regression in SBGEMM where C is assumed to be BF16, and so unconditionally attempts to cast elements of C from BF16 to FP32 resulting in incorrect outputs when beta=1.

This change fixes a regression in SBGEMM where C is assumed to be BF16, and so unconditionally casts the output to FP32 resulting in incorrect outputs when beta=1.

aditew01 · 2026-03-27T09:43:20Z

@martin-frbg is there a way we can get a patch release for OpenBLAS?
The current bug in 0.3.32 blocks OpenBLAS upgrade in PyTorch, because it causes a unit test failure.

martin-frbg · 2026-03-27T11:14:18Z

Yes, I'll try to do 0.3.33 this weekend - though the spurious utest failure bothers me a lot less than the weird DDOT bug that my workaround for the weird Neoverse SDOT bug introduced. Coincidentally the Reference-LAPACK team is committing a flurry of fixes for their upcoming 3.13, so it looks like a good chance to import those as well.

aditew01 · 2026-03-27T11:26:23Z

that'll be much appreciated! 🙏

ChipKerchner · 2026-03-29T19:17:40Z

I think this is what I've been pointing out for a while now.

aditew01 · 2026-03-31T09:22:18Z

Yes, I'll try to do 0.3.33 this weekend - though the spurious utest failure bothers me a lot less than the weird DDOT bug that my workaround for the weird Neoverse SDOT bug introduced. Coincidentally the Reference-LAPACK team is committing a flurry of fixes for their upcoming 3.13, so it looks like a good chance to import those as well.

@martin-frbg gentle nudge on this. Do you think we've a release candidate for 0.3.33 or are we waiting for more patches / fixes?
Apologies for the constant nag 🙏

Fix incorrect cast from BF16 to FP32 in SBGEMM

f6d4fe7

This change fixes a regression in SBGEMM where C is assumed to be BF16, and so unconditionally casts the output to FP32 resulting in incorrect outputs when beta=1.

martin-frbg added this to the 0.3.33 milestone Mar 26, 2026

martin-frbg merged commit 3c188e4 into OpenMathLib:develop Mar 27, 2026
99 of 102 checks passed

puneetmatharu mentioned this pull request Mar 27, 2026

Bump sources and handle latest patches ARM-software/Tool-Solutions#451

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix incorrect cast from BF16 to FP32 in SBGEMM#5712

Fix incorrect cast from BF16 to FP32 in SBGEMM#5712
martin-frbg merged 1 commit intoOpenMathLib:developfrom
murste01:develop

murste01 commented Mar 26, 2026

Uh oh!

Uh oh!

aditew01 commented Mar 27, 2026

Uh oh!

martin-frbg commented Mar 27, 2026

Uh oh!

aditew01 commented Mar 27, 2026

Uh oh!

ChipKerchner commented Mar 29, 2026

Uh oh!

aditew01 commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

murste01 commented Mar 26, 2026

Uh oh!

Uh oh!

aditew01 commented Mar 27, 2026

Uh oh!

martin-frbg commented Mar 27, 2026

Uh oh!

aditew01 commented Mar 27, 2026

Uh oh!

ChipKerchner commented Mar 29, 2026

Uh oh!

aditew01 commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants