Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUDA-aware MPI documentation needs some additional details #170

Open
maxpkatz opened this issue Dec 6, 2019 · 1 comment
Open

CUDA-aware MPI documentation needs some additional details #170

maxpkatz opened this issue Dec 6, 2019 · 1 comment

Comments

@maxpkatz
Copy link
Contributor

maxpkatz commented Dec 6, 2019

The section on CUDA-aware MPI could use a little more detail. Some of the changes I want to make:

  • Emphasize that GPUDirect RDMA is not always the right solution for sending messages, especially for bandwidth-bound messages.
  • Standardize on "GPUDirect" versus "staging" terminology as the two paths that can be taken for sending messages, and note that not everyone uses this, so be prepared for some confusion (especially IBM)
  • Describe how to disable the GPUDirect path
  • Describe how to use Spectrum MPI's environment variable PAMI_CUDA_AWARE_THRESH to control the crossover point between the GPUDirect and staging paths
  • Explain a little bit what SMPI is doing under the hood when you have CUDA-aware MPI enabled (specifically, when you're using the PAMI backend), and address some of the requirements this has on the application like not doing any CUDA calls before MPI_Init(). This addresses the problem noted in known issue: cuda hooks / pami #78, which I will still resolve separately with a contribution to the known issues page.
@jczhang07
Copy link

I am interested in the CUDA-aware MPI documentation. Could someone familiar with the topic finish this issue?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants