Skip to content

Has anyone tried adopting ffcv in timm to accerlerate training ? #1161

Answered by rwightman
Doraemonzm asked this question in Q&A
Discussion options

You must be logged in to vote

@Doraemonzm it can be a pain to keep up to date (due to breaking package dependency naming), but I always install Pillow-SIMD in my training environments, it significantly impacts CPU use if that is constrained (and often is, especially with most recent GPUs like A100s). Many cloud A100s are rather underpowered in the CPU dept (in my opinion) so keeping data preprocessing efficient matters and that's why FFCV can have an impact.

Don't overdo the -j arg, 8 train processes * 16 worker processes = 128 workers + 8 train, I doubt you have 136 physical cores so you're just causing contention, for 8 GPU on same machine you likely want something between 4-8 workers per GPU.

If htop or whatever sy…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@Doraemonzm
Comment options

Answer selected by Doraemonzm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants