Issues: NVIDIA/nccl
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
all-reduce slower on v2.20.5 compared to v2.18.5 on AWS g5.48xlarge (8 x A10G)
#1298
opened May 25, 2024 by
abdulfatir
What's the relationship between nccl protcols and inter-node communication?
#1296
opened May 24, 2024 by
Alex-Wong
Does ncclBroadcast call return at same time on different ranks?
#1294
opened May 20, 2024 by
Eiji911
Inquiry about NCCL's Tree Algorithm Performance in Single and Dual Machine Scenarios
#1290
opened May 17, 2024 by
fizzlover
How can I identify level1 nvswitch and level2 nvswitch in NCCL
#1286
opened May 14, 2024 by
Ryan201802
Why nccl ring all reduce stream duration doesn't scales with theoretical (N-1)/N?
#1282
opened May 11, 2024 by
CraneQinghe
Why is allgather's busbw a little worse than allreduce/reducescatter for the same nccl environment variables
#1281
opened May 10, 2024 by
pkuleo
Seeking for some explanations on the meaning of terminology in nvtx.h
#1280
opened May 8, 2024 by
ZhiyiHu1999
HGX 2-node test with different NIC topologies different network card names hangs, no results
#1277
opened May 8, 2024 by
superLiben
[BUG] NCCL2.20.5 meets "Message truncated : received 1024 bytes instead of 256" error while 2.18.5 not
#1273
opened Apr 30, 2024 by
shh2000
Only ~783GByte/s out of theoretical 900GB/s HGX H100 SXM Nvlink4
#1264
opened Apr 24, 2024 by
OrenLeung
Previous Next
ProTip!
Follow long discussions with comments:>50.