Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pretrain learning rate is le-8? #119

Open
5 tasks done
hegang1-tal opened this issue Aug 12, 2023 · 0 comments
Open
5 tasks done

pretrain learning rate is le-8? #119

hegang1-tal opened this issue Aug 12, 2023 · 0 comments
Labels
question Further information is requested

Comments

@hegang1-tal
Copy link

Required prerequisites

Questions

hi, I found in deepspeed config file that the learning rate is le-8. I am wandering that is this too small for pretraining ?

Checklist

  • I have provided all relevant and necessary information above.
  • I have chosen a suitable title for this issue.
@hegang1-tal hegang1-tal added the question Further information is requested label Aug 12, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant