Skip to content

Long step time (0.1s) for ResNet-18 training on Imagenet #1542

Answered by rwightman
zaccharieramzi asked this question in Q&A
Discussion options

You must be logged in to vote

@zaccharieramzi average data time still looks fairly high (number in brackets is the avg_, looks like there's a pretty slow start for the system and the avg throughput is slowly increase towards approx 1200im/sec, not sure what sort of machine it is, but the persistent SSD disks on typical cloud instances aren't very fast. A resnet18, even in float32 should be a lot faster than that, yes.

I use TFDS for imagenet in most shared drive / cloud scenarios and only use raw folder/file datasets on local machines.

Replies: 2 comments 8 replies

Comment options

You must be logged in to vote
5 replies
@zaccharieramzi
Comment options

@rwightman
Comment options

@rwightman
Comment options

@zaccharieramzi
Comment options

@zaccharieramzi
Comment options

Answer selected by zaccharieramzi
Comment options

You must be logged in to vote
3 replies
@zaccharieramzi
Comment options

@zaccharieramzi
Comment options

@zaccharieramzi
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants