Drop-in GPU acceleration for linear algebra.
-
Updated
Feb 22, 2020 - Fortran
Drop-in GPU acceleration for linear algebra.
C++ API for the BLAS of Tile Low-rank Matrix Algebra
OpenBLAS is an open source implementation of the BLAS (Basic Linear Algebra Subprograms) API with many hand-crafted optimizations for specific processor types.
Calculate the cumulative sum of strided array elements.
Interchange two vectors.
Interchange two single-precision floating-point vectors.
Calculate the sum of single-precision floating-point strided array elements.
Calculate the sum of single-precision floating-point strided array elements, ignoring NaN values, using extended accumulation, and returning an extended precision result.
Copy values from one complex single-precision floating-point vector to another complex single-precision floating-point vector.
Calculate the sum of single-precision floating-point strided array elements, ignoring NaN values, using ordinary recursive summation with extended accumulation, and returning an extended precision result.
General numerical solution for 2D Burger's equation based on explicit FCTS scheme, implemented in serial and parallel. Submitted for AE3-422 High Performance Computing
Linear algebra (work in progress)
Fast and Scalable Matrix Multiply using spark, breeze and BLAS libraries
Add a description, image, and links to the blas topic page so that developers can more easily learn about it.
To associate your repository with the blas topic, visit your repo's landing page and select "manage topics."