v0.0.25
Performance
- Significantly improve the efficiency of
delu.nn.NLinear
for cases where batch size is greater than 1. The larger the input dimensions -- the larger the speedup. Since the computation algorithm is updated, the results can be slightly different with the new version (the underlying "math" is totally the same).