-
Notifications
You must be signed in to change notification settings - Fork 344
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support GGUF #365
Comments
To give an update on the state of GGUF: Halfway August GGUF was merged into llama.cpp (ggerganov/llama.cpp#2398 (comment)). It’s full specification can be found here. Recap of what GGUF is:
|
Hi all, any updates on this? |
Hi - sorry about the lack of updates, I've been extremely busy for the last ~two months and haven't had much free time to work on |
Any updates on gguf? |
GGUF is the new file format specification that we've been designing that's designed to solve the problem of not being able to identify a model. The specification is here: ggerganov/ggml#302
llm
should be able to do the following:load_dynamic
already has an interface that should support this, but loading currently only begins after the model arch is knownllm
could do the following:The text was updated successfully, but these errors were encountered: