Closed
Description
When compiling OpenBLAS using TARGET=RISCV64_ZVL256B, there can be significant performance issues, as shown in the two tables below. The tables show the performance when using RISCV64_GENERIC and RISCV64_ZVL256B, as well as the corresponding speedup ratios. HBMV is a Level 2 function. I am trying to offload its computation to the kernel to make specific modifications for RISC-V. However, I don't know exactly how to do it. Do you have any suggestions?


Metadata
Metadata
Assignees
Labels
No labels