-
Notifications
You must be signed in to change notification settings - Fork 21.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tanh
doesnt use all cores
#2136
Comments
What happens when you fiddle around with the pertinent environment variables (some subset of |
I think this should be handled automatically really? At least, I dont see anything in any of the doc I've read suggesting one needs to do this. So, either the doc needs to be updated, or else this should be handled automatically, I reckon. |
(tanh shouldnt be using blas? I would think it uses some combination of SSE and OpenMP?) |
Oh of course, sorry. Yeah, this is a good question then! |
It seems that Tanh doesn't use OpenMP at all. It calls into TH, and it uses We could provide some guidance if anyone wants to take a stab. We already have macros that extend |
I submitted a PR, check if it looks like what's expected. @apaszke |
If one wants to use OMP on |
Closing since this has been fixed vix #2792 |
…ter (pytorch#2136) Co-authored-by: Ryan Spring <rdspring1@gmail.com>
When I run this script:
and then open
htop
, I expect to see all 8 cores running at 100%, but only 4 seem to be running? :In addition, those cores that are running, are only running at ~30-40%.
(Note that I'm not submitting a fix for this, just flagging it)
The text was updated successfully, but these errors were encountered: