Nolano.ai
Compressing Foundation models for deployment on clouds, phones and laptops
Popular repositories
-
sparse_quant_llms
sparse_quant_llms PublicSparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
-
-
InstructLLaMa.cpp
InstructLLaMa.cpp PublicFast inference of Instruct tuned LLaMa on your personal devices.
Repositories
Showing 9 of 9 repositories
-
- react-native-nolano-sdk Public
React Native SDK for building locally running LLMs applications on the phone
-
-
- llama-int4-quant Public archive
-