Issues: NVIDIA/dcgm-exporter
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
dcgmi version and dcgm-exporter version
question
Further information is requested
#319
opened Apr 30, 2024 by
nghtm
Failed to add DCGM_EXP_CLOCK_EVENTS_COUNT
bug
Something isn't working
#317
opened Apr 25, 2024 by
CodeBrek
Missing NVLINK bandwidth metrics in dcgm-exporter
bug
Something isn't working
#316
opened Apr 25, 2024 by
jz543fm
The pod for a given GPU in k8s mode cannot be captured
enhancement
New feature or request
#314
opened Apr 12, 2024 by
rokkiter
dcgm-exporter is not working on ec2 g5.48xlarge nodes
bug
Something isn't working
#313
opened Apr 11, 2024 by
eselyavka
Extremely high GPU temperature reported by dcgm-exporter
bug
Something isn't working
#312
opened Apr 11, 2024 by
age9990
can't get DCGM_EXP_XID_ERRORS_COUNT metrics
bug
Something isn't working
#310
opened Apr 8, 2024 by
homily707
Support collect detail error message with the xid
enhancement
New feature or request
#308
opened Apr 7, 2024 by
zhucan
Per pod metrics not exposed with time-slicing enabled
bug
Something isn't working
time-slicing
#307
opened Apr 5, 2024 by
ThisIsQasim
How to get current device MIG model is single or mixed?
action_required_from_requester
question
Further information is requested
#293
opened Mar 19, 2024 by
lengrongfu
memory leak
action_required_from_requester
bug
Something isn't working
#292
opened Mar 19, 2024 by
computerixxx
Expose Container info for MIG enabled GPU
bug
Something isn't working
#272
opened Feb 27, 2024 by
krishh85
Collect container name even when not using K8S
enhancement
New feature or request
#238
opened Jan 22, 2024 by
BryanQuigley
Should DCGM_FI_DEV_COUNT metric be a counter or a gauge?
action_required_from_requester
#229
opened Dec 29, 2023 by
iliakur
Why gpu drain state is not included in the dcgm-exporter
enhancement
New feature or request
#213
opened Dec 1, 2023 by
lordofire
Support for reporting FP8 and Transformer Engine usage on H100 GPU's (repost from DCGM Github)
#173
opened Jun 30, 2023 by
hassanbabaie
Helm: exporter-metrics-config-map should not be applied by default
enhancement
New feature or request
#170
opened Jun 16, 2023 by
maingoh
Running dcgm exporter without root privileges
enhancement
New feature or request
#143
opened Mar 10, 2023 by
thekuffs
Previous Next
ProTip!
Follow long discussions with comments:>50.