Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wrong compute architecture is being detected during build #23

Open
wallentx opened this issue Oct 26, 2023 · 0 comments
Open

Wrong compute architecture is being detected during build #23

wallentx opened this issue Oct 26, 2023 · 0 comments

Comments

@wallentx
Copy link

Hello!
I noticed the following during build:

./build.sh
...

-- The CXX compiler identification is GNU 13.2.1
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Check for working CXX compiler: /usr/bin/c++ - skipped
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Looking for a CUDA compiler
-- Looking for a CUDA compiler - /opt/cuda/bin/nvcc
-- The CUDA compiler identification is NVIDIA 12.2.140
-- Detecting CUDA compiler ABI info
-- Detecting CUDA compiler ABI info - done
-- Check for working CUDA compiler: /opt/cuda/bin/nvcc - skipped
-- Detecting CUDA compile features
-- Detecting CUDA compile features - done
-- ----------------------
-- peakperf build report:
-- CPU mode: ON
-- GPU mode: ON
-- ----------------------
-- Configuring done (7.2s)
-- Generating done (0.0s)
-- Build files have been written to: /home/william/src/peakperf/build
[  5%] Building CXX object CMakeFiles/cpu_device.dir/src/cpu/cpufetch/cpufetch.cpp.o
[  5%] Building CXX object CMakeFiles/512_8.dir/src/cpu/arch/512_8.cpp.o
[ 11%] Building CXX object CMakeFiles/cpu_device.dir/src/cpu/cpufetch/cpuid.cpp.o
[ 11%] Building CXX object CMakeFiles/512_12.dir/src/cpu/arch/512_12.cpp.o
[ 14%] Building CUDA object CMakeFiles/gpu_device.dir/src/gpu/arch.cu.o
[ 20%] Building CXX object CMakeFiles/cpu_device.dir/src/cpu/cpufetch/uarch.cpp.o
[ 23%] Building CXX object CMakeFiles/256_5.dir/src/cpu/arch/256_5.cpp.o
[ 29%] Building CXX object CMakeFiles/cpu_device.dir/src/cpu/arch/arch.cpp.o
[ 29%] Building CXX object CMakeFiles/256_6.dir/src/cpu/arch/256_6.cpp.o
[ 35%] Building CXX object CMakeFiles/256_6_nofma.dir/src/cpu/arch/256_6_nofma.cpp.o
[ 35%] Building CUDA object CMakeFiles/gpu_device.dir/src/gpu/kernel.cu.o
[ 35%] Building CXX object CMakeFiles/cpu_device.dir/src/cpu/arch/arch_sse.cpp.o
[ 38%] Building CXX object CMakeFiles/128_6.dir/src/cpu/arch/128_6.cpp.o
[ 44%] Building CXX object CMakeFiles/cpu_device.dir/src/cpu/arch/arch_avx512.cpp.o
[ 47%] Building CXX object CMakeFiles/cpu_device.dir/src/cpu/arch/arch_avx.cpp.o
[ 47%] Building CXX object CMakeFiles/256_8.dir/src/cpu/arch/256_8.cpp.o
[ 52%] Building CXX object CMakeFiles/128_8.dir/src/cpu/arch/128_8.cpp.o
[ 52%] Building CXX object CMakeFiles/256_10.dir/src/cpu/arch/256_10.cpp.o
nvcc fatal   : Unsupported gpu architecture 'compute_35'
make[2]: *** [CMakeFiles/gpu_device.dir/build.make:76: CMakeFiles/gpu_device.dir/src/gpu/arch.cu.o] Error 1
make[2]: *** Waiting for unfinished jobs....
nvcc fatal   : Unsupported gpu architecture 'compute_35'
make[2]: *** [CMakeFiles/gpu_device.dir/build.make:90: CMakeFiles/gpu_device.dir/src/gpu/kernel.cu.o] Error 1
make[1]: *** [CMakeFiles/Makefile2:401: CMakeFiles/gpu_device.dir/all] Error 2
make[1]: *** Waiting for unfinished jobs....
[ 55%] Linking CXX static library lib512_12.a
[ 58%] Linking CXX static library lib512_8.a
[ 58%] Built target 512_12
[ 58%] Built target 512_8
[ 61%] Linking CXX static library lib256_5.a
[ 64%] Linking CXX static library lib256_8.a
[ 67%] Linking CXX static library lib256_10.a
[ 70%] Linking CXX static library lib256_6.a
[ 73%] Linking CXX static library lib128_8.a
[ 76%] Linking CXX static library lib128_6.a
[ 76%] Built target 256_5
[ 79%] Built target 256_8
[ 79%] Built target 256_10
[ 79%] Linking CXX static library lib256_6_nofma.a
[ 79%] Built target 256_6
[ 79%] Built target 128_8
[ 82%] Linking CXX static library libcpu_device.a
[ 82%] Built target 128_6
[ 82%] Built target 256_6_nofma
[ 82%] Built target cpu_device
make: *** [Makefile:136: all] Error 2

The relevant part being nvcc fatal : Unsupported gpu architecture 'compute_35'
I looked at the code briefly, but couldn't see anything obvious that would cause it to return as compute_35.
Screenshot_20231026-030647

The getGencode script is just a validation tool I cobbled together for another project https://github.com/wallentx/alpha-report/blob/90ee2e7c006dcfd75dd76fe31ffc0a866179d819/get-gencode#L1-L33

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant