You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Any reason the learning rate is set 10x lower than the original TF implementation?
Did it work with this lr and not the original?
I was wondering of any underlying implementation differences of PyTorch that may make it tricky to reproduce the results.
The text was updated successfully, but these errors were encountered:
Any reason the learning rate is set 10x lower than the original TF implementation?
Did it work with this lr and not the original?
I was wondering of any underlying implementation differences of PyTorch that may make it tricky to reproduce the results.
The text was updated successfully, but these errors were encountered: