Releases: lucidrains/gateloop-transformer
Releases · lucidrains/gateloop-transformer
0.0.5
converges with softmax normalization, but not noticeably better
0.0.4
full attention with data dependent rel pos did not converge
0.0.3
prepare gate loop transformer for experiments
0.0.2
missing interleave for associative scan, fix axis
0.0.1
gate looped attention complete