inference-api

Here are 51 public repositories matching this topic...

roboflow / inference

A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

Updated May 10, 2024
Python

basetenlabs / truss

Star

The simplest way to serve AI/ML models in production

open-source machine-learning packaging artificial-intelligence falcon easy-to-use whisper inference-server model-serving inference-api stable-diffusion wizardlm

Updated May 10, 2024
Python

quic / ai-hub-apps

Star

The Qualcomm® AI Hub apps are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

machine-learning inference pytorch machinelearning deeplearning demos inference-engine onnx tensorflow-lite qnn inference-api

Updated May 8, 2024

quic / ai-hub-models

Star

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

machine-learning inference pytorch machinelearning deeplearning demos inference-engine onnx tensorflow-lite qnn inference-api

Updated May 7, 2024
Python

jparkerweb / bedrock-proxy-endpoint

Star

🔀 Bedrock Proxy Endpoint ⇢ Spin up your own custom OpenAI API server endpoint for easy AWS Bedrock inference (using standard baseUrl, and apiKey params)

api wrapper serverless proxy inference endpoint mistral inference-api openai-api llm aws-bedrock mixtral llama3

Updated May 6, 2024
JavaScript

Kardbord / hfapigo

Star

Unofficial (Golang) Go bindings for the Hugging Face Inference API

Updated May 6, 2024
Go

intelligencedev / eternal

Star

Eternal is an experimental platform for machine learning models and workflows.

go ai ml inference-api htmx stable-diffusion llamacpp

Updated May 6, 2024
Go

lordofthejars / fraud-detection-inference

Star

java ai inference-api

Updated Apr 10, 2024
Java

Prismadic / magnet

Star

the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly

Updated Mar 29, 2024
Python

Santhoshmani1 / Inferhub

Star

Text to image generation with stable diffusion xl model powered by hugging face inference api

reactjs image-generation inference-api stable-diffusion-xl-1

Updated Mar 28, 2024
JavaScript

defenseunicorns / leapfrogai-api

Star

LeapfrogAI API

ai inference-api llms

Updated Apr 12, 2024
Python

geniusrise / geniusrise-text

Star

Text components powering LLMs & SLMs for geniusrise framework

ai inference inference-server huggingface inference-api llm

Updated Mar 25, 2024
Python

mustafamerttunali / deep-learning-training-gui

Star

Train and predict your model on pre-trained deep learning models through the GUI (web app). No more many parameters, no more data preprocessing.

python api flask gui computer-vision deep-learning tensorflow image-classification tensorboard tensorflow-training mobilenetv2 inference-api tensorflow-predict

Updated Mar 24, 2024
Python

lordofthejars / event-driven-ai

Star

kafka ai event-driven inference-api

Updated Mar 14, 2024
HTML

An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate human readable conversational response with the help of LLM (Large Language Model).