el_manager initial refactor. #6228

cheatfate · 2024-04-22T14:47:08Z

Goals

Eliminate all the usages of "helpers" with proper primitives.
Add more error handlers and more error reporting (mostly on DEBUG level).
Adopt asyncraises usage.

beacon_chain/el/el_manager.nim

github-actions · 2024-04-22T17:11:54Z

Unit Test Results

        9 files ±0   1 319 suites ±0 25m 27s ⏱️ - 8m 12s
  4 982 tests ±0   4 634 ✔️ ±0 348 💤 ±0 0 ❌ ±0
20 802 runs ±0 20 398 ✔️ ±0 404 💤 ±0 0 ❌ ±0

Results for commit 6def75b. ± Comparison against base commit 8ca537c.

♻️ This comment has been updated with latest results.

beacon_chain/el/el_manager.nim

etan-status · 2024-04-25T22:26:40Z

beacon_chain/el/el_manager.nim

-      asyncSpawn connection.close()
-      connection.web3 = none[Web3]()
-  of Degraded:
+      await connection.close()


Could this increase the delay until a fallback client is used? close may run for 30s, and with await no progress is done during that time.

The new version is at least cleaner in tracking what is going on, the old version possibly run into situation where multiple close processes were running at same time I guess.

Every possible async task could run for 30s, that's not how we should protect the code from running for 30s.

Procedures should not spawn procedure with close and do not wait for it actually being closed. As we seen this many times before its not a good practice which leads to, leaks and UB (when just closed transport being reused by OS and you will have 2 transports with same FD in process, one is closing and another was just opened).

tersec · 2024-04-26T12:30:50Z

beacon_chain/el/el_manager.nim


 # TODO can't be defined within exchangeConfigWithSingleEL
 func `==`(x, y: Quantity): bool {.borrow.}

-proc exchangeConfigWithSingleEL(m: ELManager, connection: ELConnection) {.async.} =
+proc exchangeConfigWithSingleEL(


This is a bit of a misnomer now -- it hasn't been really exchangeConfigWithSingleEL since #5585 and #5889

What is your name proposal for it? It still performs network_id check.

sure, so checkNetworkIdWithSingleEL, say, or checkChainIdWithSingleEL

tersec · 2024-04-26T12:32:53Z

beacon_chain/el/el_manager.nim

@@ -1763,18 +1949,27 @@ func hasProperlyConfiguredConnection*(m: ELManager): bool =

  false

-proc startExchangeTransitionConfigurationLoop(m: ELManager) {.async.} =
+proc startExchangeTransitionConfigurationLoop(


Also a misnomer since #5585 and #5889 along with the debug log message, etc.

It checks chain ID now, does not exchange transition configuration

tersec · 2024-04-26T12:35:35Z

beacon_chain/el/el_manager.nim

+      pending.add(m.chainSyncingLoopFut.cancelAndWait())
+    if not(m.exchangeTransitionConfigurationLoopFut.isNil()) and
+       not(m.exchangeTransitionConfigurationLoopFut.finished()):
+      pending.add(m.exchangeTransitionConfigurationLoopFut.cancelAndWait())


Do the waits here delay clean database closing in case of stuck connections, or at least allow the database to close cleanly?

This waits are proper cancellation, you should wait until loops will not be cancelled.

Ideally, yes. My question is whether in the case where it takes arbitrarily long to finish the loops, whether the database cleanup still happens first. If it doesn't, then it places the user in a less than great position. Ultimately the Chronos state is ephemeral and the state which has to remain intact for the next run is in the database.

If you want to close database first - do it first, if you want to work properly in async world, you should signal your workers and wait them to complete their own cleanup processes. Overall construction of exit procedure in BN is incorrect, because it does not allow tasks to perform any cleanup procedures. You working with database in non-async way, if you think that you first task is close database - do it before signaling async tasks to finish.

Fair enough, and out of scope of this PR.

tersec reviewed Apr 22, 2024

View reviewed changes

beacon_chain/el/el_manager.nim Outdated Show resolved Hide resolved

tersec reviewed Apr 22, 2024

View reviewed changes

beacon_chain/el/el_manager.nim Outdated Show resolved Hide resolved

tersec reviewed Apr 22, 2024

View reviewed changes

beacon_chain/el/el_manager.nim Outdated Show resolved Hide resolved

tersec reviewed Apr 22, 2024

View reviewed changes

beacon_chain/el/el_manager.nim Outdated Show resolved Hide resolved

cheatfate force-pushed the el-refactoring branch from 6e019b9 to 763e631 Compare April 22, 2024 15:36

etan-status reviewed Apr 25, 2024

View reviewed changes

cheatfate force-pushed the el-refactoring branch 2 times, most recently from 80229c5 to 3baa56b Compare April 26, 2024 11:52

tersec reviewed Apr 26, 2024

View reviewed changes

cheatfate added 7 commits May 13, 2024 17:14

Initial commit.

05246cf

Address review comments and fix missing primitive.

10d45bb

Fix developer build.

3a69921

More asyncraises updates.

21e6c2d

Refactor and optimize forkchoiceUpdated() and sendNewPayload().

f122b6b

Fix runtime assertion.

e6d2bf0

Refactor getPayload().

6def75b

cheatfate force-pushed the el-refactoring branch from 3baa56b to 6def75b Compare May 13, 2024 14:15

arnetheduck approved these changes May 14, 2024

View reviewed changes

tersec merged commit e6b9bfc into unstable May 14, 2024
14 checks passed

tersec deleted the el-refactoring branch May 14, 2024 18:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

el_manager initial refactor. #6228

el_manager initial refactor. #6228

cheatfate commented Apr 22, 2024

github-actions bot commented Apr 22, 2024 •

edited

etan-status Apr 25, 2024

cheatfate Apr 28, 2024

cheatfate Apr 28, 2024

tersec Apr 26, 2024

cheatfate Apr 26, 2024

tersec May 10, 2024

tersec Apr 26, 2024

tersec Apr 26, 2024

cheatfate Apr 26, 2024

tersec Apr 26, 2024 •

edited

cheatfate Apr 28, 2024

tersec May 2, 2024

el_manager initial refactor. #6228

el_manager initial refactor. #6228

Conversation

cheatfate commented Apr 22, 2024

github-actions bot commented Apr 22, 2024 • edited

Unit Test Results

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tersec Apr 26, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Apr 22, 2024 •

edited

tersec Apr 26, 2024 •

edited