Support GGUF #365

philpax · 2023-07-10T00:14:27Z

GGUF is the new file format specification that we've been designing that's designed to solve the problem of not being able to identify a model. The specification is here: ggerganov/ggml#302

llm should be able to do the following:

continue supporting existing models (i.e. this change should be non-destructive)
load GGUF models and automatically dispatch to the correct model.
- load_dynamic already has an interface that should support this, but loading currently only begins after the model arch is known
- use the new information stored within the metadata to improve the UX, including automatically using the HF tokenizer if available
save GGUF models, especially in quantization

llm could do the following:

convert old models to GGUF models with prompting for missing data
implement the migration tool mentioned in the spec, which does autonomous conversion for users based on hashes

The text was updated successfully, but these errors were encountered:

EwoutH · 2023-09-08T09:29:59Z

To give an update on the state of GGUF: Halfway August GGUF was merged into llama.cpp (ggerganov/llama.cpp#2398 (comment)). It’s full specification can be found here.

Recap of what GGUF is:

binary file format for storing models for inference
designed for fast loading and saving of models
easy to use (with a few lines of code)
mmap (memory mapping) compatibility: models can be loaded using mmap for fast loading and saving.

pixelspark · 2023-09-30T09:07:00Z

Hi all, any updates on this?

philpax · 2023-10-02T20:14:29Z

Hi - sorry about the lack of updates, I've been extremely busy for the last ~two months and haven't had much free time to work on llm. I'm hoping this will ease up soon and we can start catching up proper.

Dipeshpal · 2024-04-02T15:21:41Z

Any updates on gguf?

philpax added issue:enhancement New feature or request app:cli App: the `llm` CLI labels Jul 10, 2023

philpax mentioned this issue Aug 20, 2023

GGUF support #412

Merged

18 tasks

philpax pinned this issue Sep 19, 2023

philpax linked a pull request Nov 12, 2023 that will close this issue

Develop #442

Open

17 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support GGUF #365

Support GGUF #365

philpax commented Jul 10, 2023

EwoutH commented Sep 8, 2023

pixelspark commented Sep 30, 2023

philpax commented Oct 2, 2023

Dipeshpal commented Apr 2, 2024

Support GGUF #365

Support GGUF #365

Comments

philpax commented Jul 10, 2023

EwoutH commented Sep 8, 2023

pixelspark commented Sep 30, 2023

philpax commented Oct 2, 2023

Dipeshpal commented Apr 2, 2024