Skip to content

Validation step uses 2x memory and 12x compute time #558

Closed Answered by pascal-pfeiffer
mashdragon asked this question in Q&A
Discussion options

You must be logged in to vote

You can sample the validation set to reduce the time further:

Or create a custom validation dataset with only very few samples.

It is expected that the validation metrics that rely on generation of new output (BLEU and GPT) are slower than training. Perplexity should have about the same speed as your training.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by mashdragon
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants