Add cudnn conv2d #435

yudi0201 · 2024-03-07T02:25:25Z

No description provided.

yaoyaoding

Overall looks good to me. After merging this PR, we can add a primitive function to call the conv2d_cudnn in our runtime library and have an operator like hidet.ops.conv2d_cudnn.

yaoyaoding · 2024-03-07T06:38:18Z

src/hidet/runtime/cuda/cudnn.cpp

+
+    void *dev_ptrs[3] = {ptr_x, ptr_w, ptr_y}; // device pointers
+    int64_t uids[3] = {'x', 'w', 'y'};
+    void *workspace = hidet_cuda_malloc_async(workspaceSize, cur_stream);


It might be better to use the workspace shared by all hidet operators (i.e., https://github.com/hidet-org/hidet/blob/main/include/hidet/runtime/cuda/context.h#L46).

When we run the operator in the second time, there will not be any memory allocation. Thus, it can also be used in cudaGraph.

yaoyaoding · 2024-03-07T06:40:00Z

src/hidet/runtime/cuda/cudnn.cpp

+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(xDesc));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(wDesc));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(yDesc));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(cDesc));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(fprop));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(op_graph));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(engine));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(engcfg));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(plan));


Might be good to benchmark the performance of our implementation vs. PyTorch's conv2d performance. I am not sure whether the overhead of creating/destroying descriptors is large enough to influence the performance.

vadiklyutiy · 2024-03-07T17:05:27Z

@yaoyaoding
A little bit different but connected question.
Are we planing to include conv2d_cudnn to search space? I mean we can search via our current space + cudnn implementation. Is it doable at all (wo huge redesign)?

yaoyaoding · 2024-03-07T18:34:42Z

It's doable, similar to cublas gemm: 072a606

vadiklyutiy · 2024-03-07T19:00:28Z

What about adding cudnn*, cublas* etc to search space?

yaoyaoding · 2024-03-08T09:02:29Z

What about adding cudnn*, cublas* etc to search space?

That's exactly what the commint I mentioned before does.

yudi0201 force-pushed the cudnn_conv branch from 075da22 to 86d9aca Compare March 7, 2024 02:28

yaoyaoding reviewed Mar 7, 2024

View reviewed changes

Add cudnn conv2d

4b37c61

yudi0201 force-pushed the cudnn_conv branch from ab1c51d to f7ce7ef Compare March 14, 2024 22:54

[CUDNN] Add CuDNN performance benchmarks

68192fd

yudi0201 force-pushed the cudnn_conv branch from f7ce7ef to 68192fd Compare March 14, 2024 23:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cudnn conv2d #435

Add cudnn conv2d #435

yudi0201 commented Mar 7, 2024

yaoyaoding left a comment

yaoyaoding Mar 7, 2024

yaoyaoding Mar 7, 2024

vadiklyutiy commented Mar 7, 2024

yaoyaoding commented Mar 7, 2024

vadiklyutiy commented Mar 7, 2024 •

edited

yaoyaoding commented Mar 8, 2024

Add cudnn conv2d #435

Are you sure you want to change the base?

Add cudnn conv2d #435

Conversation

yudi0201 commented Mar 7, 2024

yaoyaoding left a comment

Choose a reason for hiding this comment

yaoyaoding Mar 7, 2024

Choose a reason for hiding this comment

yaoyaoding Mar 7, 2024

Choose a reason for hiding this comment

vadiklyutiy commented Mar 7, 2024

yaoyaoding commented Mar 7, 2024

vadiklyutiy commented Mar 7, 2024 • edited

yaoyaoding commented Mar 8, 2024

vadiklyutiy commented Mar 7, 2024 •

edited