Skip to content

TPU VM trained weights #1186

Answered by rwightman
romanoss asked this question in Q&A

You must be logged in to vote

@romanoss they work the same on TPU or GPU as far as inference or fine-tuning is concerned, they are standard float32 weights, so no float16/bfloat16 issues to worry about. I did all training on TPU VM v3 instances, but all of the posted validation numbers were run/verified on a RTX 3090.

Replies: 1 comment

You must be logged in to vote
0 replies
Answer selected by romanoss
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants