You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I found a problem, since I was training with DDP, that would indicate the presence of parameters that were not involved in the training. Through my investigation, self.score_gen and self.conv are unnecessary, and these problems are not serious. But the most important thing is that self.rate_conv will not participate in the gradient calculation, because the operation of generating mask with threshold is not differentiable.
The text was updated successfully, but these errors were encountered:
self.score_gen and self.conv are not used in our models, and we forget to delete them in our code.
And thank you for pointing out the gradient-related issue. We will delve into it.
AdaIR/net/model.py
Lines 339 to 347 in 69e13fb
I found a problem, since I was training with DDP, that would indicate the presence of parameters that were not involved in the training. Through my investigation, self.score_gen and self.conv are unnecessary, and these problems are not serious. But the most important thing is that self.rate_conv will not participate in the gradient calculation, because the operation of generating mask with threshold is not differentiable.
The text was updated successfully, but these errors were encountered: