Update cudnn convolution kernel #10440

linzs148 · 2024-03-06T07:23:06Z

No description provided.

github-actions · 2024-03-06T07:24:58Z

Code got formatted by CI. Please request CI again if you still want to have this PR merged. If the PR is from a forked repo, please download the patch files from the GitHub Actions web page and apply them locally.

cmake/third_party/cudnn-frontend.cmake

oneflow/core/device/cudnn_util.h

mosout · 2024-04-25T02:23:44Z

oneflow/user/kernels/conv_cudnn_kernels.cpp

+  void Compute(user_op::KernelComputeContext* ctx, user_op::OpKernelState*,
+               const user_op::OpKernelCache* cache) const override {
+    // process context data
+    auto input = ctx->Tensor4ArgNameAndIndex("in", 0);


不可变对象比如in要用const auto*，可变对象比如tmp_buffer要用auto*，下面类似的地方都要这样

mosout · 2024-04-25T02:26:24Z

oneflow/user/kernels/conv_cudnn_kernels.cpp

+      .SetIsMatchedHob(user_op::HobDeviceType() == DeviceType::kCUDA                               \
+                       && user_op::HobEnvBool("ONEFLOW_KERNEL_ENABLE_CUDNN_V8", false))            \
+      .SetInferTmpSizeFn([](user_op::InferContext* ctx) -> size_t {                                \
+        auto& input = ctx->InputTensorDesc("in", 0);                                               \


const，下同

oneflow/core/device/cudnn_conv_util.h

mosout · 2024-04-25T02:36:58Z

oneflow/user/kernels/conv_cudnn_kernels.cpp

+
+ private:
+  void Compute(user_op::KernelComputeContext* ctx) const override {
+    auto input = ctx->Tensor4ArgNameAndIndex("x", 0);


mosout · 2024-04-25T02:37:08Z

oneflow/user/kernels/conv_cudnn_kernels.cpp

+    .SetIsMatchedHob(user_op::HobDeviceType() == DeviceType::kCUDA
+                     && user_op::HobEnvBool("ONEFLOW_KERNEL_ENABLE_CUDNN_V8", false))
+    .SetInferTmpSizeFn([](user_op::InferContext* ctx) -> size_t {
+      auto& input = ctx->InputTensorDesc("x", 0);


mosout · 2024-04-25T02:40:04Z

另外还有很多地方用了auto，可以看一下能加const的都加上const，部分函数声明中的参数在函数体中是不可变的也都加const&

add cudnn-frontend dependency

35ff41c

linzs148 requested a review from jackalcooper as a code owner March 6, 2024 07:23

linzs148 requested a review from mosout March 6, 2024 07:23

linzs148 added the op label Mar 6, 2024

auto format by CI

dc8a96e

linzs148 added enhancement feature labels Mar 6, 2024

linzs148 changed the title ~~Update cudnn convolution kernel~~ Add cudnn-frontend dependency Mar 6, 2024

linzs148 added build system and removed enhancement feature op labels Mar 6, 2024

add cudnn v8 conv forward kernel

6e01476

linzs148 changed the title ~~Add cudnn-frontend dependency~~ Update cudnn convolution kernel Mar 17, 2024

linzs148 added enhancement feature op and removed build system labels Mar 17, 2024

linzs148 added 5 commits March 27, 2024 05:09

add cudnn v8 conv backward kernel

49e2c18

transform output type to fp32 for input type fp16

e19d08a

refine workspace assign for conv v8

cd38272

fix comment

19f46f2

change heuristic value

1422954

mosout reviewed Apr 22, 2024

View reviewed changes

cmake/third_party/cudnn-frontend.cmake Outdated Show resolved Hide resolved

jackalcooper reviewed Apr 22, 2024

View reviewed changes

cmake/third_party/cudnn-frontend.cmake Outdated Show resolved Hide resolved

move cmake files to external

ddeb907

mosout reviewed Apr 25, 2024

View reviewed changes

linzs148 added 4 commits April 30, 2024 09:06

fix cmake bug

18625d9

Merge branch 'master' into feat/cudnn_conv_v8

b25ad47

install cudnn_frontend

a8fd7a3

refine functions

bea5080

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update cudnn convolution kernel #10440

Update cudnn convolution kernel #10440

linzs148 commented Mar 6, 2024

github-actions bot commented Mar 6, 2024

mosout Apr 25, 2024

mosout Apr 25, 2024

mosout Apr 25, 2024

mosout Apr 25, 2024

mosout commented Apr 25, 2024

Update cudnn convolution kernel #10440

Are you sure you want to change the base?

Update cudnn convolution kernel #10440

Conversation

linzs148 commented Mar 6, 2024

github-actions bot commented Mar 6, 2024

mosout Apr 25, 2024

Choose a reason for hiding this comment

mosout Apr 25, 2024

Choose a reason for hiding this comment

mosout Apr 25, 2024

Choose a reason for hiding this comment

mosout Apr 25, 2024

Choose a reason for hiding this comment

mosout commented Apr 25, 2024