Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

generate-dependencies-with-compile in RTX3060 Cuda11.1 #195

Open
v-nhandt21 opened this issue Dec 3, 2021 · 1 comment
Open

generate-dependencies-with-compile in RTX3060 Cuda11.1 #195

v-nhandt21 opened this issue Dec 3, 2021 · 1 comment

Comments

@v-nhandt21
Copy link

image

I wonder the cuda implement of SRU work in CUDA11, I found in the forum that to solve the problem, we must use cuda10.2 https://discuss.pytorch.org/t/just-in-time-loading-and-compiling-cuda-kernels-was-unsuccesful/124486

Here is my computer info
image

Does the speed of SRU decrease if we just use operation of pytorch, not the opt write in c++, is there that version?

If there are someone solve problem run in 3060, can you share!

@taolei87
Copy link
Contributor

taolei87 commented Dec 7, 2021

hi @v-nhandt21 , the compilation arguments such as "--generate-dependencies-with-compile" are automatically added by ninja / nvcc.

Looking at your first screenshot, ninja/nvcc attempts to build the code using --sm_75 and --compute_75 architecture code. However, the correct code should be sm_86 for your RTX 3060, according to this article:
https://arnon.dk/matching-sm-architectures-arch-and-gencode-for-various-nvidia-cards/#ampere-cuda-11-1-and-later

Maybe you can google search or ask in the Pytorch forum how to fix this arch code issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants