Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

argmax calculation on the GPU? #367

Open
TimoSaemann opened this issue Jul 12, 2017 · 5 comments
Open

argmax calculation on the GPU? #367

TimoSaemann opened this issue Jul 12, 2017 · 5 comments

Comments

@TimoSaemann
Copy link

Hi,

I have found that the argmax calculation is a bottleneck. While the forward pass of my net needs just 18 ms, the argmax calculation needs about 40 ms (1024x544 px). I'm sure you could speed up this calculation if you calculate it on the GPU. Would it be possible for you to implement an argmax.cu? That would be really helpful! Thanks!

Best,
Timo

@drnikolaev
Copy link

Hi Timo,
good idea, thank you!

@TimoSaemann
Copy link
Author

Hi Nikolaev,
are there any new developments?
Can you foresee the date, when the argmax calculation is implemented on the GPU?
Thanks a lot,
Timo

@drnikolaev
Copy link

Hi @TimoSaemann
I looked through it and I can't estimate the time, sorry. I'll update this post when get some news.

@TimoSaemann
Copy link
Author

Hi @drnikolaev
I just want to mention, that I am still interested in an implementation of the argmax.cu layer. I would really be happy to hear some information about the current status.
Thanks,
Timo

@drnikolaev
Copy link

Hi @TimoSaemann it's on my plate but there are few urgent bugs to fix before the next release, sorry about this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants