New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
1.26 on hash consumed almost all memory and CPU, did not process blocks #6965
Comments
This happened again on another machine with a Lodestar CL. It's unlikely be an interop issue then. I will remove |
Hi, do you have log with the Lodestar CL? Also, can you double check that the CL is synced? |
We have the logs in Loki, let me see what I can pull out. The CL is synced, yes. Since removing |
…eck to greater-equal (#1808)
May I know why |
Yes and yes. The intent was to allow the auto prune to migrate. Have not seen another failure since removing the pruning cache parameter |
Did not reproduce when forward syncing. Problem may come from outside block processing. |
I’ve seen this once more, it just took those 2 weeks. I’ve configured one of my servers with the cache parameter and debug logs, to hopefully catch this when it happens |
Are those running hash also or halfpath? |
This is hash, with the HalfPath parameter to convert during pruning. |
Are these node validator? |
Yes, they are part of my Lido setup. Nimbus or Lodestar depending on the server; 1,000 validators via Vouch per cluster. Means there is a decent amount of block building going on. |
Hmm... simply re-running blocks with block producer's block processor does not work. Need actual block production. |
Description
Nethermind
1.26
did not process blocks and gradually consumed all memory, and had high CPU.A restart fixed it, it caught up again. It would not quit cleanly however, Docker reaped it after the 5min configured timeout for container stop.
CL is Nimbus
v24.4.0
Logs don't appear to show a clear root cause.
Nethermind is started with these pruning parameters. It was not running a Full Prune at the time.
--Pruning.FullPruningMaxDegreeOfParallelism=3 --Pruning.FullPruningTrigger=VolumeFreeSpace --Pruning.FullPruningThresholdMb=375810 --Pruning.CacheMb=4096 --Pruning.FullPruningMemoryBudgetMb=16384 --Init.StateDbKeyScheme=HalfPath
Full startup parameters as shown by
ps auxww
:Steps to Reproduce
Unsure
Desktop (please complete the following information):
Please provide the following information regarding your setup:
v24.4.0
Logs
Nimbus logs
Logs of Nethermind in its failure state
Logs of Nethermind after restart, when it catches up
nimbus.log.gz
nm-hang.log.gz
nm-catchup.log.gz
The text was updated successfully, but these errors were encountered: