Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training reproduce #41

Open
ChaoGaoUCR opened this issue Sep 20, 2023 · 2 comments
Open

Training reproduce #41

ChaoGaoUCR opened this issue Sep 20, 2023 · 2 comments

Comments

@ChaoGaoUCR
Copy link

Dear Authors,

Thanks for the great work again.
I have a quick question,
I try to do training with 4 epochs by setting the trainer epoch to 1 and using for to repeat it four times.
I can't get the same result with this,
Any hint for what I did wrong?
image

Thanks

@HZQ950419
Copy link
Collaborator

Hi,

Did you resume from previous epoch checkpoints? If so, please ensure every epoch is training from scratch. If not, you can set a random seed to strengthen the result reproducibility. Could you report the results you got? I'd like to know how the results vary. Thanks!

@ChaoGaoUCR
Copy link
Author

Dear,

Thanks, it's my fault, I set the batch size too big(512),
I fixed it with batch size 16, now it works perfectly.

Best

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants