Alert on "GPU has fallen off the bus" message #17331
andy108369
started this conversation in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, we have the NVIDIA 4090 GPU has fallen off the bus on one of the nodes plugged into the Netdata, however there are no reports nor alerts in the Netdata regarding this:
Can you please add this to the alerting? This is highly critical to us since we are the GPU cloud provider.
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions