New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
VerneMQ connections not closing causing it to crash #2037
Comments
Check what the upstream network components do. Do you have any idle timeouts configured in a firewall? 👉 Thank you for supporting VerneMQ: https://github.com/sponsors/vernemq |
I don't think this is a Network issue. Updated the post. |
Bump |
Does this look like a reconnect overload? Do you actually see CPU maxed, maybe running on a supertiny instance? 👉 Thank you for supporting VerneMQ: https://github.com/sponsors/vernemq |
Environment
Actual behaviour
I have a VerneMQ broker running with metrics enabled, and I visualize these on a Grafana dashboard.
I have over 5200 devices connected to that broker. These devices restart their connections every approximately 16 hours.
Usually everything goes smoothly, devices restart their connections, they reconnect normally, and no issues seem to happen with the broker.
However, sometimes around restarts, the following happens:
Socket Open almost becomes the double of Socket Close:
Consequently the number of connected clients keeps going up until broker stops functioning properly:
Around that time I can observe the following errors in the logs :
After that happens, all clients are unable to connect to the broker.
Unfortunately I was not able to reproduce the issue yet as it happens haphazardly.
Update:
Also found out the following happens after the issue starts happening:
Messages get queued up:
Commands that would usually take 1 second to return start timing out after 1 minute.
The text was updated successfully, but these errors were encountered: