llama
Here are 960 public repositories matching this topic...
🤯 Lobe Chat - an open-source, modern-design ChatGPT/LLMs UI/Chat Framework. Supports speech-synthesis, multi-modal, and extensible plugin system. One-click FREE deployment of your private ChatGPT/Gemini/Ollama chat application.
-
Updated
Jun 6, 2024 - TypeScript
🚀🚀🚀A collection of some awesome public projects about Large Language Model, Vision Foundation Model and AI Generated Content.
-
Updated
Jun 6, 2024
🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.
-
Updated
Jun 6, 2024 - C++
NAACL '24 (Demo) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
-
Updated
Jun 6, 2024 - Python
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
-
Updated
Jun 6, 2024 - Python
A high-throughput and memory-efficient inference and serving engine for LLMs
-
Updated
Jun 6, 2024 - Python
🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.
-
Updated
Jun 6, 2024 - C++
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
-
Updated
Jun 6, 2024 - Python
Unify Efficient Fine-Tuning of 100+ LLMs
-
Updated
Jun 6, 2024 - Python
🤖 Collect practical AI repos, tools, websites, papers and tutorials on AI. 实用的AI百宝箱 💎
-
Updated
Jun 6, 2024 - Ruby
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
-
Updated
Jun 6, 2024 - Go
VSCode coding companion for software teams 🦆 Turn your team insights into a portable plug-and-play context for code generation. Alternative to GitHub Copilot & OpenAI GPT powered by OSS LLMs (Phi 3, Llama 3, CodeQwen, Mistral, etc.), made with ❤️ using FastAPI & Ollama.
-
Updated
Jun 6, 2024 - Python
Java version of LangChain
-
Updated
Jun 6, 2024 - Java
Improve this page
Add a description, image, and links to the llama topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the llama topic, visit your repo's landing page and select "manage topics."