Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add log_training_trajectory option #549

Open
wants to merge 9 commits into
base: opt_instruct
Choose a base branch
from

Conversation

todpole3
Copy link
Contributor

if set, evaluate and save checkpoints more frequently in early stage of training

@todpole3 todpole3 force-pushed the opt_instruct_training_trajectory branch from 5cd7183 to 54b3e87 Compare December 11, 2022 10:50
@@ -418,6 +418,10 @@ def validate_and_save(
and num_updates >= cfg.dataset.validate_after_updates
and was_successful_step
)
or (
cfg.dataset.log_training_trajectory
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why do we need to save? Just validate?

Copy link
Contributor Author

@todpole3 todpole3 Dec 12, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To plot the trajectory for generation tasks as well (ROUGE); eval will be fast

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants