Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add RWKV models #16

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

Add RWKV models #16

wants to merge 2 commits into from

Conversation

guangyusong
Copy link

@guangyusong guangyusong commented Jun 5, 2023

Changes:

Added support for RWKV model family.

Related links:

Paper: https://arxiv.org/abs/2305.13048
Github: https://github.com/BlinkDL/RWKV-LM

Screenshots:

Model list:
Screenshot 2023-06-05 at 2 10 45 AM

Sample generation:
Screenshot 2023-06-05 at 2 09 48 AM

@salesforce-cla
Copy link

salesforce-cla bot commented Jun 5, 2023

Thanks for the contribution! Before we can merge this, we need @guangyusong to sign the Salesforce Inc. Contributor License Agreement.

@bdqnghi
Copy link
Contributor

bdqnghi commented Jun 6, 2023

thank you. The RWKV model family looks very nice. However, they are not really code learning models, do you have the checkpoints related to coding tasks?

@guangyusong
Copy link
Author

The RWKV model family should have a similar data split as GPT-J. We anticipate releasing a model that's more adept at coding tasks in the near future.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants