Skip to content

Issues: NVIDIA/NeMo-Aligner

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

Can you please support context parallel?
#162 opened Apr 23, 2024 by DZ9
SFT does not work max_steps bug Something isn't working
#159 opened Apr 18, 2024 by AtsunoriFujita
Can you support KTO?
#143 opened Apr 7, 2024 by lifan-yuan
cannot load reward model from SFT model because of missing keys bug Something isn't working
#137 opened Apr 1, 2024 by DZ9
SFT is broken with container 24.01.01 bug Something isn't working
#131 opened Mar 22, 2024 by odelalleau
SFT may crash if input data exceeds the context length bug Something isn't working
#127 opened Mar 15, 2024 by odelalleau
random samplers keeps state bug Something isn't working
#107 opened Feb 14, 2024 by gshennvm
Changing num_rollout_samples modifies the validation set in PPO bug Something isn't working
#90 opened Jan 23, 2024 by odelalleau
Implement the KTO algorithm
#60 opened Dec 15, 2023 by odelalleau
Add the SFT tutorial
#59 opened Dec 15, 2023 by shengyangs
code cleanup proposal
#54 opened Dec 9, 2023 by gshennvm
ProTip! Updated in the last three days: updated:>2024-05-10.