Skip to content

OLCF/Summit: Issues on running MPICH on Summit #6142

Discussion options

You must be logged in to vote

Workaround (example: running two processes)

Workaround is manual host setting, which at least worked on Summit. The following worked with MPICH, 404cd8a.

  1. Compile MPICH with CUDA+UCX (installed to MPICH_CUDA_PATH)
# You also need to install newer libtool/autotool etc to compile MPICH.
module load cuda/11.0.3 gcc/9.1.0
./configure --with-device=ch4:ucx --prefix=$MPICH_CUDA_PATH --enable-ch4-am-only --enable-gpu-tests-only --with-cuda="$(realpath $(dirname $(which nvcc))/..)" CC=gcc CXX=gcc
  1. Compile MPICH without CUDA (installed to MPICH_NOCUDA_PATH) to get mpiexec that does not need CUDA.
    module load gcc/9.1.0
./configure --with-device=ch4:ucx --prefix=$MPICH_NOCUDA_PATH --enable-ch4-am-…

Replies: 20 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@hzhou
Comment options

Answer selected by hzhou
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
5 participants
Converted from issue

This discussion was converted from issue #4815 on August 30, 2022 15:45.