[Build] Propagate build option for CUDA minimal to TRT #20695

gedoensmax · 2024-05-16T09:02:53Z

Description

Extend cuda minimal option to TRT provider, as with TRT 10 no linking to cuDNN is required anymore
.
Besides that with the new engine dump feature it is also possible to embed an engine in to an ONNX and not ship a builder lib.
In addition to that this has roughly the same deserialization time/session setup time that using TRT standalone has.

Motivation and Context

exe_builder_lib\onnxruntime_perf_test.exe -I -e tensorrt -r 5 -i 'trt_engine_cache_enable|1 trt_timing_cache_enable|1 trt_dump_ep_context_model|1 trt_weightless_engine_enable|1' model.onnx


exe_no_builder_lib\onnxruntime_perf_test.exe -I -e tensorrt -r 5 -i 'trt_engine_cache_enable|1 trt_timing_cache_enable|1 trt_dump_ep_context_model|1 trt_weightless_engine_enable|1' model_ctx.onnx

gedoensmax · 2024-05-16T09:03:36Z

@chilo-ms and @yf711 for review

chilo-ms · 2024-05-16T18:07:11Z

Can you rebase it to main?
main now has all the CI settings to run TRT 10

gedoensmax · 2024-05-17T10:04:34Z

@chilo-ms Sure, done.

chilo-ms · 2024-05-17T15:24:41Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, Linux QNN CI Pipeline

chilo-ms · 2024-05-17T15:24:49Z

/azp run Windows CPU CI Pipeline, Windows GPU CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows ARM64 QNN CI Pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed, ONNX Runtime React Native CI Pipeline, Windows x64 QNN CI Pipeline

chilo-ms · 2024-05-17T15:24:56Z

/azp run Linux MIGraphX CI Pipeline, orttraining-amd-gpu-ci-pipeline

azure-pipelines · 2024-05-17T15:25:08Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-05-17T15:25:21Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2024-05-17T15:25:27Z

Azure Pipelines successfully started running 9 pipeline(s).

chilo-ms · 2024-05-19T17:46:40Z

cmake/onnxruntime_providers_cuda.cmake

    file(GLOB_RECURSE onnxruntime_providers_cuda_cu_srcs CONFIGURE_DEPENDS
      "${ONNXRUNTIME_ROOT}/core/providers/cuda/*.cu"
      "${ONNXRUNTIME_ROOT}/core/providers/cuda/*.cuh"
    )
+  else()
+    set(onnxruntime_providers_cuda_cu_srcs
+        "${ONNXRUNTIME_ROOT}/core/providers/cuda/math/unary_elementwise_ops_impl.cu"


It seems we need to include unary_elementwise_impl.cuh as well?
The definition of UnaryElementWiseImpl() is in that file and used by cuda::Impl_Cast<SrcT, DstT> which TRT EP will call to cast DOUBLE <- -> FLOAT or INT64 <-->INT32.

The include should be added as the inlclude directory is set correctly. If you want I can add it to sources, but usually this is not the case in CMake I would say.

yeah true, then no need to add it to sources

gedoensmax changed the base branch from main to yifanl/chi_trt10+dockerfile May 16, 2024 09:03

jywu-msft requested review from chilo-ms and yf711 May 16, 2024 21:47

fix cuda minimal linking for TRT 10

c6d7bb0

gedoensmax force-pushed the trt10_cuda_minial branch from 6566d23 to c6d7bb0 Compare May 17, 2024 09:36

gedoensmax requested a review from a team as a code owner May 17, 2024 09:36

gedoensmax changed the base branch from yifanl/chi_trt10+dockerfile to main May 17, 2024 09:36

chilo-ms reviewed May 19, 2024

View reviewed changes

chilo-ms approved these changes Jun 4, 2024

View reviewed changes

snnn requested a review from jywu-msft June 4, 2024 01:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Build] Propagate build option for CUDA minimal to TRT #20695

[Build] Propagate build option for CUDA minimal to TRT #20695

gedoensmax commented May 16, 2024

gedoensmax commented May 16, 2024

chilo-ms commented May 16, 2024

gedoensmax commented May 17, 2024

chilo-ms commented May 17, 2024

chilo-ms commented May 17, 2024

chilo-ms commented May 17, 2024

azure-pipelines bot commented May 17, 2024

azure-pipelines bot commented May 17, 2024

azure-pipelines bot commented May 17, 2024

chilo-ms May 19, 2024 •

edited

gedoensmax May 28, 2024

chilo-ms Jun 4, 2024

[Build] Propagate build option for CUDA minimal to TRT #20695

Are you sure you want to change the base?

[Build] Propagate build option for CUDA minimal to TRT #20695

Conversation

gedoensmax commented May 16, 2024

Description

Motivation and Context

gedoensmax commented May 16, 2024

chilo-ms commented May 16, 2024

gedoensmax commented May 17, 2024

chilo-ms commented May 17, 2024

chilo-ms commented May 17, 2024

chilo-ms commented May 17, 2024

azure-pipelines bot commented May 17, 2024

azure-pipelines bot commented May 17, 2024

azure-pipelines bot commented May 17, 2024

chilo-ms May 19, 2024 • edited

Choose a reason for hiding this comment

gedoensmax May 28, 2024

Choose a reason for hiding this comment

chilo-ms Jun 4, 2024

Choose a reason for hiding this comment

chilo-ms May 19, 2024 •

edited