intel-analytics
Pinned
Repositories
- ipex-llm Public
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max). A PyTorch LLM library that seamlessly integrates with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, etc.
-
- text-generation-webui Public Forked from oobabooga/text-generation-webui
A Gradio Web UI for running local LLM on Intel GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) using IPEX-LLM.
- llama_index Public Forked from run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
- Langchain-Chatchat Public Forked from chatchat-space/Langchain-Chatchat
Knowledge Base QA using RAG pipeline on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with IPEX-LLM
- private-gpt Public Forked from zylon-ai/private-gpt
Interact with your documents using the power of GPT, 100% privately, no data leaks
-
- FastChat Public Forked from lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
- ipex-llm-tutorial Public
Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm