Replies: 1 comment 1 reply
-
@shunmian you can't take hparams that are tuned for imagenet and expect them to work on a task that's closer to mnist. These RMSProp settings are unlikely to work on a smaller dataset, AdamW as an optimizer will be more forgiving. Also, disable model-ema until you get some result and then enable with a much shorter time-const (like 0.99 - 0.999). |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Describe the bug
I have been trying to train hand writing digits with
efficientnet_b2
. However, the training is not running correctly.To Reproduce
The ./HWD/ folder has following structure:
When trainning finished, the log says "Best metric: 10.0 (epoch 0)"
The full training log is as following:
When I do inference with
It produce unexpected result:
What would be the possible cause of that?
The training data is here.
Beta Was this translation helpful? Give feedback.
All reactions