triton-inference-server

An Alternative for Triton Inference Server. Boosting DL Service Throughput 1.5-4x by Ensemble Pipeline Serving with Concurrent CUDA Streams for PyTorch/LibTorch Frontend and TensorRT/CVCUDA, etc., Backends

deployment inference pytorch ray serve tensorrt serving pipeline-parallelism torch2trt triton-inference-server ray-serve cvcuda

Updated May 24, 2024
C++

triton-inference-server / onnxruntime_backend

Star

The Triton backend for the ONNX Runtime.

backend inference triton-inference-server onnx-runtime

Updated May 21, 2024
C++

YeonwooSung / MLOps

Sponsor

Star

Miscellaneous codes and writings for MLOps

Updated May 18, 2024
Jupyter Notebook

AntonioConsiglio / triton_server

Star

Streamlit Dockerized Computer Vision App with Triton Inference Server and PostgreSQL database

docker-compose postgresql streamlit triton-inference-server

Updated May 16, 2024
Python

allegroai / clearml-serving

Star

ClearML - Model-Serving Orchestration and Repository Solution

kubernetes devops machine-learning ai deep-learning triton tensorflow-serving model-serving serving mlops serving-pytorch-models triton-inference-server clearml serving-ml

Updated May 3, 2024
Python

ConnorSouthEngineering / MVision

Star

This repository contains the content for a proof of concept implementation of computer vision systems in industry. The project explores scalability and performance using the NVIDIA ecosystem, aiming to create an example scaffold for implementing a system accessible to non-technical users.

nodejs docker angular gstreamer docker-compose tensorflow postgresql python3 nvidia cv2 jetson-xavier jetson-nano triton-inference-server

Updated May 2, 2024
TypeScript

npuichigo / openai_trtllm

Star

OpenAI compatible API for TensorRT LLM triton backend

triton-inference-server openai-api llm langchain tensorrt-llm

Updated Apr 26, 2024
Rust

notAI-tech / fastDeploy

Star

Deploy DL/ ML inference pipelines with minimal extra code.

Updated Apr 23, 2024
Python

rungrodkspeed / resnet50_optimization

Star

python go pytorch convolutional-neural-networks resnet-50 tensorrt concurrent-futures onnx triton-inference-server

Updated Apr 21, 2024
Python

levipereira / deepstream-yolo-triton-server-rtsp-out

Star

The Purpose of this repository is to create a DeepStream/Triton-Server sample application that utilizes yolov7, yolov7-qat, yolov9 models to perform inference on video files or RTSP streams.

deepstream triton-inference-server deepstreamsdk triton-server yolov7 deepstream-python deepstream-python-apps yolov9

Updated Apr 1, 2024
Python

ybai789 / yolov8-triton-tensorrt

Star

Provides an ensemble model to deploy a YOLOv8 TensorRT model to Triton

deployment tensorrt triton-inference-server ultralytics yolov8

Updated Mar 28, 2024
Python

viamrobotics / viam-mlmodelservice-triton

Star

MLModelService wrapping Nvidia's Triton Server

Updated Mar 28, 2024
C++

PD-Mera / triton-basic

Star

An easy classification implement to explain how triton work

torch onnx triton-inference-server timm

Updated Mar 28, 2024
Python

olibartfast / computer-vision-triton-cpp-client

Star

C++ application to perform computer vision tasks using Nvidia Triton Server for model inference

computer-vision object-detection triton-inference-server

Updated Mar 19, 2024
C++

TunggTungg / image_retrieval

Star

An image retrieval system that utilizes deep learning ResNet for feature extraction, Local Optimized Product Quantization techniques for storage and retrieval, and efficient deployment using Nvidia technologies like TensorRT and Triton Server, all accessible through a FastAPI-powered web API.

docker deep-learning docker-compose tensorflow cnn tensorrt fastapi triton-inference-server

Updated Mar 17, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the triton-inference-server topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the triton-inference-server topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

triton-inference-server

Here are 73 public repositories matching this topic...

SteliosGian / triton-server-transformers

NVIDIA / GenerativeAIExamples

fversaci / cassandra-dali-plugin

rtzr / tritony

hoang-quoc-trung / sumen-triton

torchpipe / torchpipe