Skip to content

Releases: lucidrains/gateloop-transformer

0.0.18

10 Nov 03:15
Compare
Choose a tag to compare
additional swish gate for gateloop module

0.0.16

10 Nov 02:13
Compare
Choose a tag to compare
state transition should act on per gate loop head

0.0.15

10 Nov 01:13
Compare
Choose a tag to compare
increase default frac gradient for state transition projection

0.0.14

10 Nov 01:08
Compare
Choose a tag to compare
add an assert and encourage researchers to play around with heads

0.0.12

09 Nov 20:13
Compare
Choose a tag to compare
fix a misunderstanding, thanks to main author @tobiaskatsch for the d…

0.0.11

09 Nov 18:06
Compare
Choose a tag to compare
able to ablate state transitions

0.0.10

09 Nov 17:31
Compare
Choose a tag to compare
need to see something before deciding whether to invest time in cuda …

0.0.8

09 Nov 16:26
Compare
Choose a tag to compare
allow for training full attention with rotary + data dependent xpos s…

0.0.7

09 Nov 15:12
Compare
Choose a tag to compare
misunderstood how activation functions were applied

0.0.6

09 Nov 02:25
Compare
Choose a tag to compare
0.0.6