Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cuda code that approaches cublas performance #255

Open
nyck33 opened this issue Apr 26, 2024 · 0 comments
Open

cuda code that approaches cublas performance #255

nyck33 opened this issue Apr 26, 2024 · 0 comments

Comments

@nyck33
Copy link

nyck33 commented Apr 26, 2024

https://colab.research.google.com/drive/1RNFSPtD0o9aJFwnqKQSRabODtSZjwPN1 by https://makslevental.github.io/ based on https://siboehm.com/articles/22/CUDA-MMM seems quite fast and then I'm also looking at this: https://thunder.snu.ac.kr/?page_id=64&page=6 I'm just fishing for opinions and am planning to try to emulate that blog/website and try to implement matmul_forward for this repo to start. Or if anyone else wants to use these for reference, please go ahead.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant