Introduce `BlockSize` #3716

schnellerhase · 2025-04-27T15:32:22Z

In performance critical parts some block sizes are optimized for by compiling explicit versions with the block size being provided as a compile time constant. At the same time general runtime block sizes are supported through an argument to these functions.

This causes

Code duplication: one path for the runtime and one for the compile time definitions of the block sizes, and
duplicate input of the block sizes: once as template argument once as argument (matching of both is only asserted does not raise in release due to performance impact)

Introduces a BlockSize concept that either holds a runtime int or a compile time std::integral_constant<int, bs> which allows to generate code paths explicitly for certain sizes, while maintaining a shared code path in both cases.

form packing optimizes for block sizes 1,2,3 - vector assembly for 1,3: is this miss match intentional?
matrix operation routines

jhale · 2025-04-27T18:36:15Z

Looks very nice. Could we review the basic approach before you spend lots more time on it?

schnellerhase · 2025-04-27T18:49:26Z

Sure thing. Should be good to go as is and can be extended further when approved. One neat byproduct, that these changes would allow for, are non compile time sized operations on the MatrixCSR which we are currently missing.

chrisrichardson

Looking good.

garth-wells · 2025-04-28T17:16:16Z

Looks really neat.

Should the name be more generic, it's basically a runtime or templated integer. I can think of applications outside of block size, e.g. geometric dimension, where it could be useful.
Should it support different integer types?
Could tests be added to check that when it's a compile time integer that it really is a compiler time integer?

schnellerhase · 2025-04-28T17:56:29Z

For points 1 and 2 that should be no problem - how about: ConstexprType as name for the general concept?

Regarding 3: the interface to retrieve the value (here block_size) needs to be able to produce both a runtime value and a compile time value. Therefore it can not be marked constexpr. Testing for in lining of the compile time variant is also not straight forward as this remains in all cases a compiler decision. Best way to check for its effect, I assume, would be with a benchmark of those cases.

garth-wells · 2025-04-30T08:03:01Z

Regarding 3: the interface to retrieve the value (here block_size) needs to be able to produce both a runtime value and a compile time value. Therefore it can not be marked constexpr. Testing for in lining of the compile time variant is also not straight forward as this remains in all cases a compiler decision. Best way to check for its effect, I assume, would be with a benchmark of those cases.

I don't like relying on the compiler to inline things that we know are known at compile time. We have avoided this in the past and preferred being explicit over relying on the compiler and then not knowing what the compiler does.

schnellerhase · 2025-04-30T08:14:12Z

It would be best if the block_size/value function would be constexpr for the compile time case. I will try if I can recover that behaviour.

schnellerhase · 2025-04-30T15:30:53Z

It think I have a fix: value(ConxtexprType<T, V>) is now constexpr for is_compile_v<T, V> == True and otherwise not. The test case showcases that we can assert during compile time now. ~~(Block size is not yet adapted)~~.

This reverts commit 2652fb5.

schnellerhase added 6 commits April 27, 2025 17:21

Introduce BlockSize concept

5f80934

Use BlockSize in packing

65ff61f

Use BlockSize in vector assembly

a708e7d

Adapt demo

5b65ad8

Introduce BS<> alias

29a1219

Use BlockSize in spmv

10cc79c

schnellerhase force-pushed the block_size branch from bd90307 to 10cc79c Compare April 27, 2025 18:19

doc

1fb65d4

schnellerhase marked this pull request as ready for review April 27, 2025 18:50

chrisrichardson self-requested a review April 28, 2025 15:26

chrisrichardson approved these changes Apr 28, 2025

View reviewed changes

schnellerhase added 4 commits April 30, 2025 01:38

Introduce generic ConstexprType

dcfef33

value()

b8b0f90

Add test case

152e8d0

format

0e2ad15

schnellerhase added 2 commits April 30, 2025 17:22

constexpr value access

31a146d

format

6a4d5b5

schnellerhase added 6 commits April 30, 2025 22:40

Bump PETSc/SLEPc

2652fb5

Revert "Bump PETSc/SLEPc"

c762822

This reverts commit 2652fb5.

Tidy up

796725c

Merge branch 'main' into block_size

7602862

Compiler limitation for floating point values

460b350

Misses year code

5c1d722

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Introduce `BlockSize` #3716

Introduce `BlockSize` #3716

Uh oh!

schnellerhase commented Apr 27, 2025 •

edited

Loading

Uh oh!

jhale commented Apr 27, 2025

Uh oh!

schnellerhase commented Apr 27, 2025

Uh oh!

chrisrichardson left a comment

Uh oh!

garth-wells commented Apr 28, 2025

Uh oh!

schnellerhase commented Apr 28, 2025

Uh oh!

garth-wells commented Apr 30, 2025

Uh oh!

schnellerhase commented Apr 30, 2025

Uh oh!

schnellerhase commented Apr 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Introduce BlockSize #3716

Are you sure you want to change the base?

Introduce BlockSize #3716

Uh oh!

Conversation

schnellerhase commented Apr 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jhale commented Apr 27, 2025

Uh oh!

schnellerhase commented Apr 27, 2025

Uh oh!

chrisrichardson left a comment

Choose a reason for hiding this comment

Uh oh!

garth-wells commented Apr 28, 2025

Uh oh!

schnellerhase commented Apr 28, 2025

Uh oh!

garth-wells commented Apr 30, 2025

Uh oh!

schnellerhase commented Apr 30, 2025

Uh oh!

schnellerhase commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Introduce `BlockSize` #3716

Introduce `BlockSize` #3716

schnellerhase commented Apr 27, 2025 •

edited

Loading

schnellerhase commented Apr 30, 2025 •

edited

Loading