Parallel Heterogeneous CPU/GPU computing on diffusion equation with OpenMP, CUDA, Thrust, OpenACC, TBB

This project can use interconnected GPUs by PCIe or Nvlink with P2P connection.

Requirement

CUDA / Thrust / OpenACC

Install NVIDIA HPC kit

https://developer.nvidia.com/hpc-sdk

And setup the CUDA_PATH toward the hpc kit directory

Example:

export CUDA_PATH=/opt/nvidia/hpc_sdk/Linux_x86_64/22.2/cuda

nvcc required for CUDA and thrust
nvc++ required for openACC

Thread building-blocks

sudo apt install libtbb-dev

or

Follow this : https://www.intel.com/content/www/us/en/developer/articles/guide/get-started-with-tbb.html

OpenMP

Any compatible compiler like GNU g++ or LLVM clang++

Build

Each project can be built as library separately :

OpenACC in directory acc
OpenMP in directory omp
Thrust CPU (TBB/OpenMP) in directory thrust_cpu
Thrust GPU (CUDA) in directory thrust_gpu
OpenACC in directory acc

All computation model and library can be built in one cmake but every dependencies is required and the binary will be able to be executed

The cmake will select the specific required compiler for each subproject (g++, clang++, nvcc, nvc++)

C++ 17 was used due to usage of thrust template and the usage of SFINAE template technical style.

If you have configured clang++ to be able to compile cuda code you can replace

-DCMAKE_CUDA_COMPILER=nvcc
by
-DCMAKE_CUDA_COMPILER=clang++

Build

cd c++ && \
cmake \
-DCMAKE_BUILD_TYPE=RelWithDebInfo \
-DCMAKE_CUDA_COMPILER=nvcc \
-B build -S . && \
cmake --build build

Execute

./build/bin/stencil 10000 10000

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

acc

acc

c++

c++

cuda

cuda

include

include

omp

omp

thrust_cpu

thrust_cpu

thrust_gpu

thrust_gpu

Readme.md

Readme.md

download_thrust.sh

download_thrust.sh

Repository files navigation

Parallel Heterogeneous CPU/GPU computing on diffusion equation with OpenMP, CUDA, Thrust, OpenACC, TBB

Requirement

CUDA / Thrust / OpenACC

Thread building-blocks

OpenMP

Build

Build

Execute

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
acc		acc
c++		c++
cuda		cuda
include		include
omp		omp
thrust_cpu		thrust_cpu
thrust_gpu		thrust_gpu
Readme.md		Readme.md
download_thrust.sh		download_thrust.sh

neudinger/equadiffGPU

Folders and files

Latest commit

History

Repository files navigation

Parallel Heterogeneous CPU/GPU computing on diffusion equation with OpenMP, CUDA, Thrust, OpenACC, TBB

Requirement

CUDA / Thrust / OpenACC

Thread building-blocks

OpenMP

Build

Build

Execute

About

Topics

Resources

Stars

Watchers

Forks

Languages