Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to use GPA tool to get advice #5

Open
HarsonLau opened this issue Aug 2, 2022 · 3 comments
Open

Unable to use GPA tool to get advice #5

HarsonLau opened this issue Aug 2, 2022 · 3 comments

Comments

@HarsonLau
Copy link

I completed the installation of this project through your installation script. My GPU is T4 and I am using cuda toolkit 11.6.
When I followed the tutorial in install.md, using GPA to get advice, I get no output in the directory gpa-database/

root@n37-139-082:~# cd GPA/
root@n37-139-082:~/GPA# cd ./GPA-Benchmark/ExaTENSOR/exatensor-opt1
root@n37-139-082:~/GPA/GPA-Benchmark/ExaTENSOR/exatensor-opt1# make 
make: 'all' is up to date.
root@n37-139-082:~/GPA/GPA-Benchmark/ExaTENSOR/exatensor-opt1# make clean 
rm -rf main *.o *.dot *.hpcstruct *.cubin *.qdrep *.sqlite
root@n37-139-082:~/GPA/GPA-Benchmark/ExaTENSOR/exatensor-opt1# make 
nvcc -o main main.cu -DCUDA3 -Xcompiler "-g -fopenmp" -O3 -lineinfo  -lcudart -lcuda -lstdc++ -lm
root@n37-139-082:~/GPA/GPA-Benchmark/ExaTENSOR/exatensor-opt1# gpa -v ./main
Make sure gpa-measurements and gpa-database is clean
Profiling: collect pc sampling performance metrics
Parsing: parse CPU and GPU binaries
Analyzing: match metrics with advice
Output advice in gpa-database/gpa.advice
Done...
root@n37-139-082:~/GPA/GPA-Benchmark/ExaTENSOR/exatensor-opt1# ls gpa-database/
root@n37-139-082:~/GPA/GPA-Benchmark/ExaTENSOR/exatensor-opt1# 

This is the content of gpa.log

root@n37-139-082:~/GPA/GPA-Benchmark/ExaTENSOR/exatensor-opt1# cat gpa.log 
NOTE: Using builtin path for NVIDIA's CUPTI tools library /usr/local/cuda/lib64/libcupti.so.
Elapsed time 0.013346
msg: begin serial analysis of 8a99426adfeaf92557e0b6842027decb.cubin
WARNING: incomplete analysis of 8a99426adfeaf92557e0b6842027decb.cubin; see /root/GPA/GPA-Benchmark/ExaTENSOR/exatensor-opt1/gpa-measurements/structs/8a99426adfeaf92557e0b6842027decb.cubin.warnings for details
msg: end serial analysis of 8a99426adfeaf92557e0b6842027decb.cubin
HPCStructure fatal error: processing Document:STRUCTURE file '/root/GPA/GPA-Benchmark/ExaTENSOR/exatensor-opt1/gpa-measurements/structs/8a99426adfeaf92557e0b6842027decb.cubin.hpcstruct' at line 69, character 1:
        XML parser: invalid document structure.
root@n37-139-082:~/GPA/GPA-Benchmark/ExaTENSOR/exatensor-opt1# 

The warning mentioned above shows that a segfault has occurred, but I don't know exactly what is causing it

root@n37-139-082:~/GPA/GPA-Benchmark/ExaTENSOR/exatensor-opt1# cat gpa-measurements/structs/8a99426adfeaf92557e0b6842027decb.cubin.warnings 
Segmentation fault (core dumped)
@Jokeren
Copy link
Owner

Jokeren commented Aug 2, 2022

Can you please try cuda 11.1.?

@HarsonLau
Copy link
Author

Can you please try cuda 11.1.?

Unfortunately, even with cuda 11.1.0 it still doesn't work.
YTe3igcckF
This is the error message recorded in gpa.log after adding the -v option. As you can see, I am running GPA as root, and this permission-related error is very confusing

@Jokeren
Copy link
Owner

Jokeren commented Aug 3, 2022

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants