inference-api

Here are 51 public repositories matching this topic...

bentoml / BentoML

The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!

kubernetes machine-learning microservices ai deep-learning model-management model-deployment model-serving multimodal-deep-learning mlops bentoml ml-platform inference-api ai-infra model-inference generative-ai lmops llmops

Updated Apr 26, 2024
Python

roboflow / inference

Star

A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

Updated Apr 27, 2024
Python

BMW-InnovationLab / BMW-TensorFlow-Training-GUI

Star

This repository allows you to get started with a gui based training a State-of-the-art Deep Learning model with little to no configuration needed! NoCode training with TensorFlow has never been so easy.

Updated Feb 13, 2024
Python

basetenlabs / truss

Star

The simplest way to serve AI/ML models in production

open-source machine-learning packaging artificial-intelligence falcon easy-to-use whisper inference-server model-serving inference-api stable-diffusion wizardlm

Updated Apr 25, 2024
Python

quic / ai-hub-models

Star

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

machine-learning inference pytorch machinelearning deeplearning demos inference-engine onnx tensorflow-lite qnn inference-api

Updated Apr 16, 2024
Python

Michael-OvO / Yolov7-Flask

Star

A Beautiful Flask Web API for Yolov7 (and custom) models

python flask pytorch object-detection flask-web pretrained-weights model-deployment torchhub inference-api yolov7

Updated Sep 20, 2022
Python

mustafamerttunali / deep-learning-training-gui

Star

Train and predict your model on pre-trained deep learning models through the GUI (web app). No more many parameters, no more data preprocessing.

python api flask gui computer-vision deep-learning tensorflow image-classification tensorboard tensorflow-training mobilenetv2 inference-api tensorflow-predict

Updated Mar 24, 2024
Python

pszemraj / textsum

Star

CLI & Python API to easily summarize text-based files with transformers

pipeline text transformers inference transformer summarization summary batch-processing inference-api text-to-text-transformer

Updated Feb 18, 2024
Python

BMW-InnovationLab / BMW-Classification-Training-GUI

Star

This repository allows you to get started with training a State-of-the-art Deep Learning model with little to no configuration needed! You provide your labeled dataset and you can start the training right away. You can even test your model with our built-in Inference REST API. Training classification models with GluonCV has never been so easy.

training computer-vision deep-learning classification gluoncv inference-api

Updated May 11, 2022
Python

BMW-InnovationLab / BMW-Classification-Inference-GPU-CPU

Star

This is a repository for an image classification inference API using the Gluoncv framework. The inference REST API works on CPU/GPU. It's supported on Windows and Linux Operating systems. Models trained using our Gluoncv Classification training repository can be deployed in this API. Several models can be loaded and used at the same time.

computer-vision deep-learning inference classification gluoncv inference-api

Updated May 4, 2022
Python

Kardbord / hfapigo

Star

Unofficial (Golang) Go bindings for the Hugging Face Inference API

Updated Feb 18, 2024
Go

TimMikeladze / huggingface

Star

Typescript wrapper for the Hugging Face Inference API.

typescript inference huggingface inference-api hugging-face

Updated Mar 2, 2023
TypeScript

hupe1980 / go-huggingface

Star

🤗 Hugging Face Inference Client written in Go

golang huggingface inference-api

Updated Jan 7, 2024
Go

yas-sim / openvino-ep-enabled-onnxruntime

Star

Describing How to Enable OpenVINO Execution Provider for ONNX Runtime

deep-learning intel inference inference-engine inference-library onnx onnx-format onnx-backend openvino onnxruntime inference-api openvino-toolkit

Updated Jun 29, 2020
C++

Prismadic / magnet

Star

the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly

Updated Mar 29, 2024
Python

decisionfacts / semantic-ai

Star

An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate human readable conversational response with the help of LLM (Large Language Model).