About minimizing local_llm image size #476

hardychen1991 · 2024-04-15T09:54:07Z

Hi, thanks for such a great work!
Just wondering if someone has done something to minimizing local_llm image size?

I've tried to build my customized image for text-only SLM inference, specifically with Gemma-2B. But the amount of base images and packages is a bit overwhelming.
Any information or advices maybe? Thanks!

dusty-nv · 2024-04-15T14:03:09Z

Hi @hardychen1991, yea I feel you, have been trying to make this smaller and build faster, in fact we basically re-did most of the containers in this repo for minimization. Perhaps unsurprisingly considering what it achieves, this one has many big/complex dependencies including MLC/TVM, AWQ, FAISS, ASR/TTS, ect so it is still quite large. The local_llm has also transitioned to NanoLLM for future development, where I hope to continue making progress on issues like this:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About minimizing local_llm image size #476

About minimizing local_llm image size #476

hardychen1991 commented Apr 15, 2024 •

edited

dusty-nv commented Apr 15, 2024 •

edited

About minimizing local_llm image size #476

About minimizing local_llm image size #476

Comments

hardychen1991 commented Apr 15, 2024 • edited

dusty-nv commented Apr 15, 2024 • edited

hardychen1991 commented Apr 15, 2024 •

edited

dusty-nv commented Apr 15, 2024 •

edited