Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-release consolidated OPT / OPT-IML checkpoints #625

Open
suchenzang opened this issue Feb 1, 2023 · 2 comments
Open

Re-release consolidated OPT / OPT-IML checkpoints #625

suchenzang opened this issue Feb 1, 2023 · 2 comments
Assignees
Labels
enhancement New feature or request

Comments

@suchenzang
Copy link
Contributor

After #459 and #556, we can now release updated checkpoints that are consolidated from FSDP shards with different model parallelism as well. We should update all of our checkpoints as a start to help address some of the following painpoints that users are facing:

and previous issues:

We have internal consolidated versions for 2.7B and 30B to check against, and will also need to confirm that generation looks roughly sane after consolidation.

@suchenzang suchenzang added the enhancement New feature or request label Feb 1, 2023
@EIFY
Copy link

EIFY commented Feb 2, 2023

I can work around it, but could the following issue be considered related as well?

@andrewPoulton
Copy link
Contributor

Yeah looks like it, at least tangentially - the loading logic there could probably do with simplifying. It should be possible to identify naming convention by just reading the checkpoint directory.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

6 participants