[Question] Scale down futher to support IOT usecases? #50

kinchahoy · 2024-02-28T07:36:04Z

Question

I'm trying to see what can run on an 8GB Raspberry Pi 5, and it occours to me that your approach might scale down really well. Any tips for replicating what you did with something like TinyLlama or trying for an 8 bit quantization of LlaVA-Phi? I'd love to try training some sort of student model as an experiment from the more successful models you've trained.

kinchahoy · 2024-02-29T22:18:07Z

For what it's worth 4 bit quantizations of LLaVA 1.6 work quite well even in the limited context of a Raspberry Pi. I'll try quantizing MOE-LLaVa soon. Let me know if this is interesting.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Scale down futher to support IOT usecases? #50

[Question] Scale down futher to support IOT usecases? #50

kinchahoy commented Feb 28, 2024

kinchahoy commented Feb 29, 2024

[Question] Scale down futher to support IOT usecases? #50

[Question] Scale down futher to support IOT usecases? #50

Comments

kinchahoy commented Feb 28, 2024

Question

kinchahoy commented Feb 29, 2024