Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tensor Cores #996

Open
lostmsu opened this issue May 5, 2023 · 1 comment
Open

Tensor Cores #996

lostmsu opened this issue May 5, 2023 · 1 comment
Labels
feature A new feature (or feature request)
Milestone

Comments

@lostmsu
Copy link
Contributor

lostmsu commented May 5, 2023

A way to utilize tensor cores is needed, which should draw from the family of VectorXXX intrinsics in .NET and/or Vulkan Cooperative Matrix extension proposed by NVidia.

Related CUDA documentation: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#warp-level-matrix-instructions

This is also mentioned in #923 , but the later is more about the support for shorter floats in general.

@MoFtZ MoFtZ added the feature A new feature (or feature request) label May 5, 2023
@m4rs-mt
Copy link
Owner

m4rs-mt commented May 9, 2023

Thanks a lot for your feature request. Given the performance improvements that can be achieved using Tensor Cores on NVIDIA hardware, it definitely makes sense to add support for Tensor Cores in 2.0 (which is going to be the next big release after v1.5).

@m4rs-mt m4rs-mt added this to the v2.0 milestone May 9, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature A new feature (or feature request)
Projects
None yet
Development

No branches or pull requests

3 participants