Skip to content
This repository has been archived by the owner on Jan 15, 2024. It is now read-only.

Validation and testing too slow when train Transformer #565

Answered by ymjiang
ymjiang asked this question in Q&A
Discussion options

You must be logged in to vote

Problem solved, I hard-code the optimizer to SGD in train_transformer.py for some reasons. When I use Adam instead, the loss decrease much more quickly. At final step, the validation only takes about 5 minutes. And the training speed reaches 130 kwps. Sorry about my mistake.

Replies: 9 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

ymjiang
Jan 21, 2019
Collaborator Author

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

ymjiang
Jan 21, 2019
Collaborator Author

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

ymjiang
Jan 21, 2019
Collaborator Author

You must be logged in to vote
0 replies
Comment options

ymjiang
Jan 23, 2019
Collaborator Author

You must be logged in to vote
0 replies
Answer selected by szha
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants
Converted from issue

This discussion was converted from issue #565 on August 30, 2020 19:15.