#

triton-inference-server

Here are 73 public repositories matching this topic...

k9ele7en / Triton-TensorRT-Inference-CRAFT-pytorch

Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> ONNX -> TensorRT, Inference pipelines (TensorRT, Triton server - multi-format). Supported model format for Triton inference: TensorRT engine, Torchscript, ONNX

inference pytorch text-detection nvidia-docker inference-server tensorrt inference-engine onnx onnx-torch tensorrt-conversion triton-inference-server text-detection-from-image

Updated Aug 18, 2021
Python

dudeperf3ct / end-to-end-images

This repo contains code for training and deploying PyTorch models with applications in images in end-to-end fashion.

pytorch image-classification fastapi triton-inference-server

Updated Nov 20, 2021
Jupyter Notebook

yas-sim / openvino-model-server-wrapper

Python wrapper class for OpenVINO Model Server. User can submit inference request to OVMS with just a few lines of code.

python cloud ai deep-learning grpc intel inference edge object-tracking tensorflow-serving grpc-client model-serving serving openvino line-crossing-detection area-intrusion-detection triton-inference-server openvino-docker openvino-model-server

Updated Jan 16, 2022
Python

tonhathuy / tensorrt-triton-magface

Magface Triton Inferece Server Using Tensorrt

face-recognition onnx triton-inference-server magface tensorrt-engine

Updated Feb 12, 2022
Jupyter Notebook

LeslieZhoa / Triton-Torch-Custom

Triton-Pytorch Custom operator tutorial

pytorch triton-inference-server custom-operators

Updated Mar 21, 2022
Python

Bobo-y / triton_ensemble_model_demo

triton server ensemble model demo

pipeline triton-inference-server

Updated May 2, 2022
Python

isarsoft / yolov4-triton-tensorrt

This repository deploys YOLOv4 as an optimized TensorRT engine to Triton Inference Server

docker deep-learning object-detection tensorrt yolov4 triton-inference-server yolov4-tiny

Updated Jun 2, 2022
C++

octoml / ariel

A library for interfacing with Triton.

python machine-learning triton triton-inference-server

Updated Jun 8, 2022
Python

dpressel / reserve

FastAPI + WebSockets + SSE service to interface with Triton/Riva ASR

sse socketio asr riva fastapi triton-inference-server

Updated Jul 14, 2022
Python

niyazed / triton-mnist-example

MNIST inference example on NVIDIA Triton Inference Server

python docker machine-learning deep-learning tensorflow grpc inference pytorch nvidia-docker model-deployment mnist-handwriting-recognition triton-inference-server

Updated Sep 3, 2022
PureBasic

swapkh91 / detectron2-to-tensorrt

Notebook with commands to convert a Detectron2 MaskRCNN model to TensorRT

object-detection tensorrt detectron2 triton-inference-server

Updated Oct 12, 2022
Jupyter Notebook

oneonlee / Building-Transformer-Based-NLP-Applications

NVIDIA DLI "트랜스포머 기반 자연어 처리 애플리케이션 구축" 워크숍 레포지토리

nlp machine-learning text-classification nvidia transformer nemo ner bert word-embedding self-supervision triton-inference-server

Updated Oct 18, 2022
Jupyter Notebook

lastsign / Task_STT_Bot

Microservices with HTTP, Triton Inference Server, FastApi and Docker-compose

docker deep-learning docker-compose stt speach-to-text fastapi triton-inference-server

Updated Oct 23, 2022
Python

Alek-dr / FastAPI-TrironServer-example

machine-learning fastapi triton-inference-server

Updated Nov 10, 2022
Python

eitansela / sagemaker-mme-gpu-triton-java-client

Run Multiple Models on the Same GPU with Amazon SageMaker Multi-Model Endpoints Powered by NVIDIA Triton Inference Server. A Java client is also provided.

python java multi sagemaker-deployment sagemaker-example triton-inference-server

Updated Nov 14, 2022
Java

detail-novelist / novelist-triton-server

Deploy KoGPT with Triton Inference Server

transformers triton huggingface triton-inference-server kogpt gptj large-language-models fastertransformer

Updated Nov 18, 2022
Shell

tuxedocat / triton-client-polyglot-example

Example of generating triton-inference-server clients for some programming languages

typescript grpc-node triton-inference-server

Updated Dec 1, 2022
TypeScript

hiennguyen9874 / triton-face-recognition

Triton face detection & recognition

deep-learning face-recognition face-detection triton-inference-server

Updated Jan 3, 2023
Jupyter Notebook

CoinCheung / BiSeNet

Add bisenetv2. My implementation of BiSeNet

pytorch cityscapes tensorrt ncnn ade20k cocostuff openvino bisenet triton-inference-server

Updated Feb 5, 2023
Python

mirekphd / tritonserver-tritonclient-starter-xgb

A complete containerized setup for Triton inference server and its python client using a realistic pre-trained XGBoost classifier model.

xgboost triton-inference-server tritonclient

Updated Feb 14, 2023
Python

Improve this page

Add a description, image, and links to the triton-inference-server topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the triton-inference-server topic, visit your repo's landing page and select "manage topics."