The gradient (Tensor.grad) of decoder weights is None #143

NonvolatileMemory · 2020-07-05T04:24:18Z

Hello,
I want to get the gradient w.r.t the parameters in decoder like embedding layer's weights and ffn layer's weights.
However when I run following command the results are always None.

print(model.decoder.layers[0].fc1.weight.grad)

and the following command always return True even the FFN weights:

model.decoder.layers[0].fc1.weight.is_leaf

I don't know where going wrong, thank you

NonvolatileMemory · 2020-07-05T06:40:36Z

hook works fine to get the gradient

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The gradient (Tensor.grad) of decoder weights is None #143

The gradient (Tensor.grad) of decoder weights is None #143

NonvolatileMemory commented Jul 5, 2020

NonvolatileMemory commented Jul 5, 2020

The gradient (Tensor.grad) of decoder weights is None #143

The gradient (Tensor.grad) of decoder weights is None #143

Comments

NonvolatileMemory commented Jul 5, 2020

NonvolatileMemory commented Jul 5, 2020