Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DPO训练的时候grad_norm出现nan值 #923

Open
rtz1998 opened this issue May 13, 2024 · 1 comment
Open

DPO训练的时候grad_norm出现nan值 #923

rtz1998 opened this issue May 13, 2024 · 1 comment

Comments

@rtz1998
Copy link

rtz1998 commented May 13, 2024

使用Qwen1.5-7B-Chat在dpo训练的时候出现grad_norm出现Nan值,然后模型不更新

  1. 尝试将dtype变成fp32依然出现该情况

image

@tastelikefeet
Copy link
Collaborator

跑飞了,lr怎么设置的

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants