Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NVML C2C queries for NVIDIA GPU-CPU links #633

Open
bgoglin opened this issue Oct 30, 2023 · 0 comments
Open

NVML C2C queries for NVIDIA GPU-CPU links #633

bgoglin opened this issue Oct 30, 2023 · 0 comments

Comments

@bgoglin
Copy link
Contributor

bgoglin commented Oct 30, 2023

nvmlDeviceGetFieldValues() can retrieve CPU-GPU link info.

#define NVML_FI_DEV_C2C_LINK_COUNT                    170 //!< Number of C2C Links present on the device
#define NVML_FI_DEV_C2C_LINK_GET_STATUS               171 //!< C2C Link Status 0=INACTIVE 1=ACTIVE
#define NVML_FI_DEV_C2C_LINK_GET_MAX_BW               172 //!< C2C Link Speed in MBps for active links

(from https://github.com/NVIDIA/nvidia-settings/blob/main/src/nvml.h#L1793)

NCCL already uses those at https://github.com/NVIDIA/nccl/blob/master/src/graph/xml.cc#L702

Those links go to the local CPU (PCI local).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant