triton-inference-server / onnxruntime_backend Public

Notifications
Fork 53
Star 114

Code
Issues 63
Pull requests 4
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: triton-inference-server/onnxruntime_backend

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

63 Open 36 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Is onnxruntime-genai supported?

#251 opened May 4, 2024 by jackylu0124

Failed to allocated memory for requested buffer of size X

#249 opened Mar 21, 2024 by aaditya-srivathsan

Facing errors when installing onnxruntime backend for triton

#247 opened Mar 15, 2024 by Aniket-20

CPU Throttling when Deploying Triton with ONNX Backend on Kubernetes

#245 opened Mar 1, 2024 by langong347

Enable "trt_build_heuristics_enable" optimization for onnxruntime-TensorRT

#241 opened Feb 23, 2024 by tobaiMS

Will onxxruntime backend support INT8 on cpu ?

#240 opened Feb 20, 2024 by bharadwajymg

Request for Supporting minShapes/optShapes/maxShapes for TensorRT

#232 opened Jan 15, 2024 by teith

Question: Does ONNX-RT silently fallbacks to CPU?

#228 opened Dec 20, 2023 by Thytu

Model failed to create because of output dimensions

#220 opened Nov 27, 2023 by nyanmn

Support arbitrary options for execution providers

#217 opened Nov 16, 2023 by gedoensmax

Error while Loading YOLOv8 Model with EfficientNMS_TRT Plugin in TRITON

#210 opened Aug 30, 2023 by whitewalker11

how to use onnxruntime profiling in triton

#207 opened Jul 25, 2023 by cyh-ustc

Onnxruntime backend error when workload is high since Triton uses CUDA 12 bug

Something isn't working

#203 opened Jul 8, 2023 by zeruniverse

GPU memory leak with high load for ONNX model

#198 opened Jun 14, 2023 by junwang-wish

Add enable_dynamic_shapes To Model Config To Resolve CNN Memory Leaks With OpenVino EP

#194 opened Jun 2, 2023 by narolski

How to create onnx model for ragged batching?

#192 opened May 30, 2023 by Sitcebelly

InvalidArgumentError: The tensor Input (Input) of Slice op is not initialized.

#191 opened May 25, 2023 by qiu-pinggaizi

Fatal error: TRT:EfficientNMS_TRT(-1) is not a registered function/op

#185 opened May 4, 2023 by levipereira

Can I build the Onnxruntime backend for Windows without Docker??

#175 opened Mar 15, 2023 by victorsoyvictor

Update onnxruntime to 1.14.0 or 1.14.1 to fix TensorRT issue

#173 opened Mar 10, 2023 by OvervCW

Add option to enable CUDA Graphs in CUDA EP

#168 opened Feb 15, 2023 by nealvaidya

Expose session.use_device_allocator_for_initializers in onnxruntime_backend to completely shrink arena

#166 opened Jan 16, 2023 by zeruniverse

Possible to enable dynamic batch dimension only on one some input tensors?

#165 opened Dec 30, 2022 by kgu3

Request statistics reported incorrectly

#164 opened Dec 28, 2022 by vkatms

Global GPU Memory Limit

#161 opened Nov 25, 2022 by FabianSchuetze

Previous 1 2 3 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2024-04-25.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly