Skip to content

Latest commit

 

History

History
14 lines (9 loc) · 577 Bytes

README.md

File metadata and controls

14 lines (9 loc) · 577 Bytes

AI_kernels

Machine learning and AI kernels implemented in various languages and frameworks.

General information and links for each framework are located in the vector_add subdirectories.

  • vector_add - The simplest kernel that adds two vectors
  • dot - Dot product of two vectors
  • gemv - Matrix-vector multiplication
  • gemm - Matrix-matrix multiplication
  • latency - Microbenchmark to measure memory latency

There are links to additional technical information about GPU topics in the info directory.