-
Notifications
You must be signed in to change notification settings - Fork 995
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Heartbeat table updating. Positions table not. Logs show successfully processing 0 records. #2010
Comments
is data flowing during this time? Ie is the whole system stuck or just heartbeats? |
encounter the same problem The data flows normally |
In my case, the logs always show 0 UserRecords and 0 KinesisRecords even though rows are being updated in the database.
Also, the Maxwell's But the |
definitely sounds like maxwell's binlog replication thread is stuck or dead; the fact that the Can you capture complete logs from the system? The kinesis logs you're showing don't necessarily mean that much, but maybe there's some errors or warnings or info before that? Curious when you say "after a database restart" -- does maxwell not die when the database first restarts? or is this when maxwell restarts after the db restarts? If all else fails you might want to try running with |
Nothing wrong with the console interface ,When restart maxwell ,The service becomes normal |
There's no obvious error log on the console, either |
I don't have the full logs but I did read them and nothing jumped out.
We have it set to unlimited reconnection retries. I think I saw it retry a few thousand times in a minute (those numbers are from memory, don't count on them being accurate) or so and then when the database came back up it reconnected and started logging as if everything was A-OK.
I'll give this a shot. Thanks! One other note: this isn't 100% reproducible for us. Sometimes after Maxwell can't connect for a ~minute it will reconnect and continue processing data normally. Rough estimate is that we've seen it break ~3 times out of ~20 database restarts. Perhaps of note is that usually when we do a database restart it's because we're resizing an RDS instance. I didn't expect that to be an issue because we're not upgrading MySQL, changing IP address, or doing anything else that I'd expect to affect the connection/binlog. If nothing jumps out as an "ah-hah" from this description, I can try to narrow in on making it reproducible. |
We've encountered this at least twice. Both times after a database restart. We're on RDS. The first time was a resize. The second time was a change to a parameter group (or something like that).
We're running in the
zendesk/maxwell:v1.37.6
docker image.docker logs --follow
shows this and it updates in realtime:Running
SELECT * FROM maxwell.heartbeats;
a few times shows the heartbeat updating.Running
SELECT * FROM maxwell.positions;
repeatedly never shows any update. It's stuck at alast_heartbeat_read
and abinlog_file
from hours ago.docker restart
fixes the issue.Any ideas? Threads to pull on? More info I can provide?
Thanks for any help.
The text was updated successfully, but these errors were encountered: