You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Aug 3, 2021. It is now read-only.
I used my data abt 1000h speech and trained on 8x Tesla V100-16GB using horovod and mixed precision. With batch_size_per_gpu equals to 32, the training time is abt 3.2s per step, it took 3.5h for 1 epoch. Is it expected? Can I reduce the training time?
The text was updated successfully, but these errors were encountered:
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
I used my data abt 1000h speech and trained on 8x Tesla V100-16GB using horovod and mixed precision. With batch_size_per_gpu equals to 32, the training time is abt 3.2s per step, it took 3.5h for 1 epoch. Is it expected? Can I reduce the training time?
The text was updated successfully, but these errors were encountered: