-
Notifications
You must be signed in to change notification settings - Fork 298
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DPO训练的时候grad_norm出现nan值 #923
Comments
跑飞了,lr怎么设置的 |
我也是跑飞了 qwen14b 8xA100 全参数 微调 dpo 调整过精度,还有什么推荐的修改方案么?感谢大佬们 日志:
脚本:
ds设置
|
lr 调整 还是会有Nan 由
|
去掉deepspeed会不会有作用 |
已经解决了不,拉取一下main分支 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
使用Qwen1.5-7B-Chat在dpo训练的时候出现grad_norm出现Nan值,然后模型不更新
The text was updated successfully, but these errors were encountered: