✨[Feature] Autogen TRT Plugins #2817

narendasan · 2024-05-06T23:10:53Z

Is your feature request related to a problem? Please describe.

There are some cases where taking the extra step of wrapping a torch layer or pytorch custom op in a plugin to embed in at TRT engine may improve the performance of the model. However, there is a ton of boilerplate needed to actually access the operator through TensorRT. It would be great if this could get abstracted away for users.

Describe the solution you'd like

Given a functional torch operator and an FakeTensor implementation, autogenerate the TensorRT plugin code to allow that op to be embedded in a TRT engine.

Describe alternatives you've considered

This could be done in C++ as well but may be more complicated than handling this in Python.

Additional context

https://github.com/pytorch/TensorRT/blob/main/examples/dynamo/custom_kernel_plugins.py

narendasan added the feature request New feature or request label May 6, 2024

narendasan self-assigned this May 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨[Feature] Autogen TRT Plugins #2817

✨[Feature] Autogen TRT Plugins #2817

narendasan commented May 6, 2024

✨[Feature] Autogen TRT Plugins #2817

✨[Feature] Autogen TRT Plugins #2817

Comments

narendasan commented May 6, 2024