-
Notifications
You must be signed in to change notification settings - Fork 135
Issues: OpenLLMAI/OpenRLHF
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
使用Deepseek-lite训练DPO,显示expected mat1 and mat2 to have the same type, but got: float != c10: : BFLoat16
#306
opened May 27, 2024 by
victorShawFan
RM training loss becomes NAN when finish the first training step.
#288
opened May 11, 2024 by
lixsh6
reward model数据集问题
documentation
Improvements or additions to documentation
#273
opened Apr 18, 2024 by
burger-pb
add test pipeline: use small LLM and small data
documentation
Improvements or additions to documentation
enhancement
New feature or request
#267
opened Apr 11, 2024 by
catqaq
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.