Skip to content

Commit e433066

Browse files
author
JAppleyard
committed
Adding readme to LSTM code
1 parent 34391cf commit e433066

File tree

1 file changed

+5
-0
lines changed

1 file changed

+5
-0
lines changed

posts/rnn/README.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
## Optimized LSTM forward pass in CUDA
2+
3+
This code implements an LSTM forward pass in CUDA using cuBLAS. It uses cuRand to initialize the inputs and parameters.
4+
5+
The PERFOPTS define allows the user to select which optimizations are enabled. It is a bitmask taking values from 0 to 31 with each bit defining whether a particular optimization is enabled.

0 commit comments

Comments
 (0)