inference

🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.

Updated May 13, 2024
Python

huggingface / text-generation-inference

Star

Large Language Model Text Generation Inference

nlp bloom deep-learning inference pytorch falcon transformer gpt starcoder

Updated May 13, 2024
Python

pykeio / ort

Sponsor

Star

A Rust wrapper for ONNX Runtime

rust machine-learning ai webassembly wasm inference onnx onnxruntime

Updated May 13, 2024
Rust

fal-ai / interactive-3d-demo-app

Star

Interactive 3d demo of 3d and image AI inference provided by fal.ai

ai realtime ml inference 3d

Updated May 13, 2024
TypeScript

microsoft / DeepSpeed

Star

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

machine-learning compression deep-learning gpu inference pytorch zero data-parallelism model-parallelism mixture-of-experts pipeline-parallelism billion-parameters trillion-parameters

Updated May 13, 2024
Python

roboflow / inference

Star

A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

Updated May 13, 2024
Python

aws / amazon-sagemaker-examples

Star

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

training aws data-science machine-learning reinforcement-learning deep-learning examples jupyter-notebook inference sagemaker mlops

Updated May 13, 2024
Jupyter Notebook

Blane187 / AICoverGenMod

Star

mod version of AICoverGen

pipeline inference webui gradio mdx

Updated May 13, 2024
Python

Improve this page

Add a description, image, and links to the inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inference

Here are 1,181 public repositories matching this topic...

vllm-project / vllm

google / XNNPACK

microsoft / aici

vectorch-ai / ScaleLLM

ggerganov / whisper.cpp

google / mediapipe

openvinotoolkit / openvino

jhn-nt / dpsdc

deepjavalibrary / djl-serving

alteryx / woodwork

blefo / FastInference

hpcaitech / ColossalAI

SuperDuperDB / superduperdb

huggingface / text-generation-inference

pykeio / ort

fal-ai / interactive-3d-demo-app

microsoft / DeepSpeed

roboflow / inference

aws / amazon-sagemaker-examples

Blane187 / AICoverGenMod

Improve this page

Add this topic to your repo