Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ThunderKittens Backend #407

Open
AndreSlavescu opened this issue May 13, 2024 · 1 comment
Open

ThunderKittens Backend #407

AndreSlavescu opened this issue May 13, 2024 · 1 comment

Comments

@AndreSlavescu
Copy link

Would it be an idea to define the same kernels that exist in the CUDA backend with ThunderKittens as well? They have cool examples with FlashAttention2 and I think it would be interesting to have as an educational resource as well. Thoughts?

@ademeure
Copy link
Contributor

Yes! I'm planning to look into using ThunderKittens once I've got more time (probably 2nd week of June). I'm not sure there's much point using it for kernels that don't use the tensor core though? But it might allow fusing even more things together (e.g. matmul and fused classifier maybe)

My plan was to mostly focus on making a hyper-optimised path for H100 using TMA though... But we'll see what happens :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants