[Backlog] Add sparse models to options #52

claysauruswrecks · 2023-04-12T04:06:23Z

I don't know of any right now, this is just a placeholder for people to fill in if they are aware of such options.

Here is an example of a performance increase from this pruning process: https://github.com/mlcommons/inference_results_v3.0/tree/main/open/NeuralMagic

deep-diver · 2023-04-20T17:12:43Z

Can you elaborate?

claysauruswrecks · 2023-04-20T21:53:20Z

Sure, trimming involves removing nodes and connections in the network while minimizing accuracy loss. There is also an inference performance gain in both speed and hardware requirements.

Here is one such framework for pruning models, which resulted in the benchmark mentioned above: https://github.com/neuralmagic/deepsparse

Someone is bound to prune the LLaMA derivatives, and I opened this task so others might track or see it and add theirs.

claysauruswrecks changed the title ~~Add sparse models to options~~ [Backlog]Add sparse models to options Apr 20, 2023

claysauruswrecks changed the title ~~[Backlog]Add sparse models to options~~ [Backlog] Add sparse models to options Apr 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Backlog] Add sparse models to options #52

[Backlog] Add sparse models to options #52

claysauruswrecks commented Apr 12, 2023

deep-diver commented Apr 20, 2023

claysauruswrecks commented Apr 20, 2023 •

edited

[Backlog] Add sparse models to options #52

[Backlog] Add sparse models to options #52

Comments

claysauruswrecks commented Apr 12, 2023

deep-diver commented Apr 20, 2023

claysauruswrecks commented Apr 20, 2023 • edited

claysauruswrecks commented Apr 20, 2023 •

edited