You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, thank you for sharing you work. I have a question about the parameters setting. Durding training, you set the mini-batch size to 1 and the number of iterations to 6e5. Is there any theory or experiment that proves that such a setting is better than a large mini-batch size and a small number of iterations?
The text was updated successfully, but these errors were encountered:
Hi, thank you for sharing you work. I have a question about the parameters setting. Durding training, you set the mini-batch size to 1 and the number of iterations to 6e5. Is there any theory or experiment that proves that such a setting is better than a large mini-batch size and a small number of iterations?
The text was updated successfully, but these errors were encountered: