Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

3 Billion Model is throwing error #3

Open
okpatil4u opened this issue Mar 3, 2023 · 2 comments
Open

3 Billion Model is throwing error #3

okpatil4u opened this issue Mar 3, 2023 · 2 comments

Comments

@okpatil4u
Copy link

1.5 billion model works as intended. But 3 billion model refuses to convert on my system.

(mps) ➜  verbaflow git:(main) ✗ ./verbaflow -model-dir models/nlpodyssey/rwkv-4-pile-3b-instruct download
(mps) ➜  verbaflow git:(main) ✗ ./verbaflow -model-dir models/nlpodyssey/rwkv-4-pile-3b-instruct convert
5:28PM FTL error="model conversion failed: failed to convert embeddings: only BFloat16Storage is supported, actual *pytorch.FloatStorage"

Similarly, 1.5 billion model works well with original BlinkDL/RWKV-LM code, but 3 billion model throws dimension mismatch error. Is current 3B model working correctly at your end ?

@matteo-grella
Copy link
Member

Thank you @okpatil4u for giving the 3B model a try. It appears that the 3B model has an extra conversion that renders it incompatible with the current converter. Rest assured that I will be uploading a revised version of the 3B model within the week. If you require it sooner, please let me know, and I can provide you with straightforward instructions on how to do it yourself.

May I ask what architecture you are using? I'm interested in hearing about the performance of the 1B5 version with your setup.

@okpatil4u
Copy link
Author

Thanks Matteo, no need to hurry. I am using Mac M1 Max. I have been trying to get RWKV working on Core ML, but it seems that RNNs are not supported. But it was interesting to see Verbaflow built on Golang.

This is a pretty cool project !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants