LIBSPMV

Overview

LIBSPMV is a proof-of-concept distributed-memory Sparse Matrix-Vector Multiplication (SpMV) library. It currently implements pure-MPI and hybrid MPI+X where X is one of the following:

OpenMP for CPUs
OpenMP offloading for GPUs
SYCL for CPUs and GPUs
CUDA for NVIDIA GPUs

The communication phase of the SpMV kernel is implemented with multiple MPI-based communication models, including blocking and non-blocking collective and point-to-point communication, as well as one-sided communication. Also, the library focuses on optimizing SpMV for symmetric matrices, which often arise in scientific applications. Finally, the library provides an implementation of the Conjugate Gradient (CG) method for solving large sparse linear systems, using the optimized SpMV kernel.

Getting Started

Prerequisites

CMake >= 3.18
C++ compiler with C++17 support
Eigen >= 3.3.9 (this dependency will be deprecated soon)
A BLAS library
MPI implementation that supports the MPI-3.0 standard and is CUDA-aware (in case ENABLE_CUDA=on)

Installation

It is recommended to build LIBSPMV in a separate directory form the source directory. The basic steps for building with CMake are:

Create a build directory, outside of the source directory.
In your build directory run cmake <path-to-libspmv-src>
It is recommended to set options by calling ccmake in your build directory. Alternatively you can use the -DVARIABLE=value syntax in the previous step.
Run make to build.
Run make test to run sanity checks.
Run make install to install in directory defined through the -DCMAKE_INSTALL_PREFIX=<path> option.

Optional features

To enable OpenMP on CPUs, use the -DENABLE_OPENMP=on option.
To enable OpenMP offloading on GPUs, use the -DENABLE_OPENMP_OFFLOAD=on option.
To enable SYCL, use the -DENABLE_DPCPP=on option for Intel DPC++ or -DENABLE_HIPSYCL=on option for hipSYCL. For hipSYCL, also set the HIPSYCL_TARGETS environment variable to select the target devices, e.g., 'omp;cuda:sm_xx'.
To enable GPU offloading with CUDA, use the -DENABLE_CUDA=on option with -DCMAKE_CUDA_ARCHITECTURES=<sm_xx> to select the target device's compute capability.
To enable use of Intel MKL sparse BLAS kernels where possible, use the -DENABLE_MKL=on.

Name		Name	Last commit message	Last commit date
Latest commit History 276 Commits
.github/workflows		.github/workflows
demos		demos
doc		doc
docker		docker
spmv		spmv
tests		tests
.clang-format		.clang-format
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

demos

demos

doc

doc

docker

docker

spmv

spmv

tests

tests

.clang-format

.clang-format

.gitignore

.gitignore

.gitlab-ci.yml

.gitlab-ci.yml

CMakeLists.txt

CMakeLists.txt

LICENSE

LICENSE

README.md

README.md

Repository files navigation

LIBSPMV

Overview

Getting Started

Prerequisites

Installation

Optional features

About

Releases

Packages

Languages

License

Excalibur-SLE/spmv

Folders and files

Latest commit

History

Repository files navigation

LIBSPMV

Overview

Getting Started

Prerequisites

Installation

Optional features

About

Topics

Resources

License

Stars

Watchers

Forks

Languages