Skip to content
This repository has been archived by the owner on Mar 15, 2024. It is now read-only.

Single machine multi-GPU training #213

Open
AlexNmSED opened this issue Mar 16, 2023 · 0 comments
Open

Single machine multi-GPU training #213

AlexNmSED opened this issue Mar 16, 2023 · 0 comments

Comments

@AlexNmSED
Copy link

When I use 4 GPUS in single machine , I meet this question:
runtimeerror: [/pytorch/third_party/gloo/gloo/transport/tcp/pair.cc:575] connectruntclosed by peer [172.16.173.129]:23211

Someone can help me ?

Thank you .

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant