Simple Matrix Multiplication in CUDA. Fills the Arrays and Multiplies them as a linear arrangement. As evident from the screenshots, CUDA is really fast. Matrices were 1024*1024 but I only printed 5 and 10 in each case. Nvprof was unsupported so I resorted to simple timing events/chrono to record the execution time.
-
Notifications
You must be signed in to change notification settings - Fork 0
paradoxical95/Matrix_Multiplication_CUDA
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
About
Simple Matrix Multiplication in CUDA on a 1024*1024 grid
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published

