How to parallelize `matmul` beyond OpenMP directives in MKL?
Activity