Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix implicit computation of max_steps when gbs < rollout size #151

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

odelalleau
Copy link
Collaborator

@odelalleau odelalleau commented Apr 11, 2024

This commit also ensures this code is compatible with the change in
NVIDIA/NeMo#8744
which requires using a NeMo sampler to compute max_steps correctly (it does not work with a basic PyTorch sampler -- though I will update that NeMo PR to make it work with a warning as this made be realize it could be breaking some user code otherwise).

Tested locally on my workstation.

This commit also ensures this code is compatible with the change in
        NVIDIA/NeMo#8744
which requires using a NeMo sampler to compute `max_steps` correctly
(it does not work with a basic PyTorch sampler).

Signed-off-by: Olivier Delalleau <507137+odelalleau@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant