Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

When running DPA-1 with 'dp train train-2.json', it gets stuck at the 'OMP' step. #3690

Open
wangyi01 opened this issue Apr 19, 2024 · 1 comment
Labels

Comments

@wangyi01
Copy link

wangyi01 commented Apr 19, 2024

Summary

When using the 4090 card for operation, it will get stuck at:
OMP: Info #254: IMP AFFINTTY: pid 27858 tid 27975 thread 11 bound to OS proc set 5
OMP: Info #254: KMP AFFINITY: pid 27858 tid 27976 thread 19 bound to OS proc set 0

DeePMD-kit Version

deepmd-kit:2.2.1-cuda11.6

Backend and its version

It didn't run until printing the DeePMD-kit logo.

Details

There is no further information afterwards, and when checking with 'nvidia-smi', it is found that it did not start running.

@njzjz
Copy link
Member

njzjz commented May 22, 2024

Will it still happen when you limit OMP threads?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants