#

llm

Here are 7,308 public repositories matching this topic...

superagent-ai / superagent

🥷 Run AI-agents with an API

python agent open-source ai assistant rag llm generative-ai

Updated May 23, 2024
TypeScript

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

amd cuda inference pytorch transformer llama gpt rocm model-serving mlops llm inferentia llmops llm-serving trainium

Updated May 23, 2024
Python

promptflow

microsoft / promptflow

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

ai prompt gpt ai-applications ai-application-development llm prompt-engineering chatgpt

Updated May 23, 2024
Python

LongxingTan / open-retrievals

Text Embedding for Retrieval, Rerank and RAG

nlp information-retrieval retrieval semantic-search triplet-loss contrastive-loss rag text-embeddings tranformers llm retrieval-augmented-generation rerank

Updated May 23, 2024
Python

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

llama cuda-kernels deepspeed llm fastertransformer llm-inference turbomind internlm llama2 codellama llama3

Updated May 23, 2024
Python

cpldcpu / MisguidedAttention

A collection of prompts to challenge the reasoning abilities of large language models

large-language-models llm

Updated May 23, 2024

mudler / LocalAI

🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

Updated May 23, 2024
C++

milvus

milvus-io / milvus

A cloud-native vector database, storage for next generation AI applications

golang distributed nearest-neighbor-search cloud-native image-search vector-similarity faiss anns hnsw vector-search vector-database llm embedding-database embedding-store vector-store embedding-similarity tensor-database

Updated May 23, 2024
Go

modelscope / swift

ms-swift: Use PEFT or Full-parameter to finetune 200+ LLMs or 15+ MLLMs

agent deploy llama lora finetune peft multimodal sft dpo pre-training awq llm modelscope llava qwen galore unsloth llama3 pissa

Updated May 23, 2024
Python

BlackTechX011 / Ollama-in-GitHub-Codespaces

Learn all how to run Ollama in GitHub Codespaces for free

linux cli cloud ai sandbox artificial-intelligence codespaces github-codespaces large-language-models llm ollama ollama-app ollama-api blacktechx blacktechx011

Updated May 23, 2024
Jupyter Notebook

infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

nlp machine-learning information-retrieval ocr deep-learning orchestration preprocessing pdf-to-text data-pipelines document-parser rag document-understanding table-structure-recognition llm llmops retrieval-augmented-generation

Updated May 23, 2024
Python

promptfoo / promptfoo

Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.

testing ci evaluation ci-cd cicd prompts evaluation-framework rag llm prompt-engineering llmops prompt-testing llm-eval llm-evaluation llm-evaluation-framework

Updated May 23, 2024
TypeScript

rmusser01 / tldw

Too Long, Didn't Watch(TL/DW): Your Personal Research Multi-Tool

open-source youtube research apache2 summarizer multi-tool ai-ml ai-assistant yt-dlp llm ai-ml-beginner-projects summarizer-ai

Updated May 23, 2024
Python

rashadphz / farfalle

🔍 AI search engine - self-host with local or cloud LLMs

react search-engine nextjs openai tailwindcss perplexity fastapi groq llm shadcn-ui ollama generative-ui gpt-4o

Updated May 23, 2024
TypeScript

janhq / cortex

Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM). Powers 👋 Jan

ai cuda llama accelerated inference-engine openai-api llm stable-diffusion llms llamacpp llama2 gguf tensorrt-llm

Updated May 23, 2024
C++

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

agent benchmark evaluation survey transformer compress blogs papers ssm long-term-memory rag awsome-list large-language-models llm long-context-modeling length-extrapolation

Updated May 23, 2024

1Panel-dev / MaxKB

🚀 基于 LLM 大语言模型的知识库问答系统。开箱即用，支持快速嵌入到第三方业务系统，1Panel 官方出品。

openai knowledgebase gpt llm 1panel ollama maxkb

Updated May 23, 2024
Python

aws-samples / generative-bi-using-rag

A NLQ(Natural Language Query) demo using Amazon Bedrock, Amazon OpenSearch with RAG technique.

text-to-sql rag nlq text2sql llm nlq-to-sql

Updated May 23, 2024
Python

LaoshuBaby / liulianmao

A LLM client for use from the command line or IDE. 一个在命令行或者IDE中使用的大语言模型客户端

client llm chatgpt-api

Updated May 23, 2024
Python

awesome-sora / awesome-sora

😎 Awesome list of interesting topics on Sora

awesome ai awesome-ai sora llm chatgpt soraai

Updated May 23, 2024

Improve this page

Add a description, image, and links to the llm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm topic, visit your repo's landing page and select "manage topics."