An extension library of WMMA API (Tensor Core API)
-
Updated
Feb 20, 2024 - Cuda
An extension library of WMMA API (Tensor Core API)
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme
SParse AcceleRation on Tensor Architecture
Fast SGEMM emulation on Tensor Cores
An extension library of WMMA API for single precision matrix operation using TensorCores and error correction technique
Add a description, image, and links to the tensorcores topic page so that developers can more easily learn about it.
To associate your repository with the tensorcores topic, visit your repo's landing page and select "manage topics."