-
Notifications
You must be signed in to change notification settings - Fork 58
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
runtime dispatching with MIPP library, papi for performance measurement
* pf_conv_* libraries for naive convolution with architecture specific optimizations - utilizing MIPP library for SIMD and benchmark utilitzing papi for measurement * MIPP with [un]install patch from https://github.com/hayguen/MIPP should be installed * bench_mixers also uses papi (if available) * fixed address sanitizer (asan) * removed cmake options DISABLE_SIMD_AVX, USE_SIMD_NEON * target_optimizations.cmake: - added optional additional(extra) options to target_set_c[xx]_arch_flags() - added gcc_clang_fpu options to target_set_cxx_arch_option() macro * with 'neon' variants for dispatching on cmake processor 'armv7l' papi doesn't work on this 32 bit Raspbian on the Raspberry Pi 4B hardware, but linux 'time' shows a total speedup of nearly factor 2 comparing '-a 0' (none) agains '-a <n>' (neon) Signed-off-by: hayati ayguen <[email protected]>
- Loading branch information
Showing
16 changed files
with
1,734 additions
and
228 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.