feat: Eliminate addressBook from Network components #13010

kfa-aguda · 2024-04-25T21:55:26Z

closes #12984

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

This reverts commit f23c2ac. Signed-off-by: Kore Aguda <kore@swirldslabs.com>

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

github-actions · 2024-04-25T22:27:15Z

Node: HAPI Test (Restart) Results

2 tests 2 ✅ 5m 36s ⏱️
2 suites 0 💤
2 files 0 ❌

Results for commit dd516d0.

♻️ This comment has been updated with latest results.

github-actions · 2024-04-25T22:27:47Z

Node: HAPI Test (Node Death Reconnect) Results

1 tests 1 ✅ 24s ⏱️
1 suites 0 💤
2 files 0 ❌
1 errors

For more details on these parsing errors, see this check.

Results for commit dd516d0.

♻️ This comment has been updated with latest results.

github-actions · 2024-04-25T22:37:51Z

Node: HAPI Test (Token) Results

228 tests 227 ✅ 17m 37s ⏱️
16 suites 1 💤
16 files 0 ❌

Results for commit dd516d0.

♻️ This comment has been updated with latest results.

github-actions · 2024-04-25T22:55:43Z

Node: HAPI Test (Misc) Results

457 tests 447 ✅ 35m 42s ⏱️
76 suites 10 💤
77 files 0 ❌
1 errors

For more details on these parsing errors, see this check.

Results for commit dd516d0.

♻️ This comment has been updated with latest results.

github-actions · 2024-04-25T22:58:24Z

Node: HAPI Test (Crypto) Results

335 tests 335 ✅ 36m 23s ⏱️
25 suites 0 💤
25 files 0 ❌

Results for commit dd516d0.

♻️ This comment has been updated with latest results.

github-actions · 2024-04-25T23:14:34Z

Node: HAPI Test (Time Consuming) Results

21 tests 21 ✅ 54m 18s ⏱️
3 suites 0 💤
3 files 0 ❌

Results for commit dd516d0.

♻️ This comment has been updated with latest results.

github-actions · 2024-04-25T23:19:56Z

Node: Unit Test Results

2 267 files ±0 2 267 suites ±0 3h 27m 7s ⏱️ + 25m 0s
112 332 tests ±0 112 265 ✅ ±0 67 💤 ±0 0 ❌ ±0
120 795 runs ±0 120 728 ✅ ±0 67 💤 ±0 0 ❌ ±0

Results for commit dd516d0. ± Comparison against base commit b1ad671.

This pull request removes 4002 and adds 3765 tests. Note that renamed tests count towards both.


  
             IssuerDN: CN=s-aaaa
            SubjectDN: CN=s-aaaa
           Final Date: Fri Jan 01 00:00:00 UTC 2100
           Public Key: RSA Public Key [2e:28:bc:1e:d3:83:25:92:8e:cb:98:b1:b6:84:06:9c:d5:d8:14:d5],[56:66:d1:a4]
           Start Date: Sat Jan 01 00:00:00 UTC 2000
         SerialNumber: 12482092706667292405
        modulus: c1a0ff5d2372b53d12d12bb87dd03f5e…
        modulus: c1a0ff5d2372b53d12d12bb87dd03f5…
…

com.hedera.node.app.grpc.impl.netty.GrpcServiceBuilderTest ‑ [4] 

com.hedera.node.app.grpc.impl.netty.GrpcServiceBuilderTest ‑ [6] 

com.hedera.node.app.grpc.impl.netty.GrpcServiceBuilderTest ‑ [7]   
  
com.hedera.node.app.service.mono.state.codec.VirtualKeySerdesAdapterTest ‑ [10] com.hedera.node.app.service.mono.state.codec.VirtualBlobKey@2cb0b062
com.hedera.node.app.service.mono.state.codec.VirtualKeySerdesAdapterTest ‑ [11] com.hedera.node.app.service.mono.state.codec.VirtualBlobKey@51e97a97
com.hedera.node.app.service.mono.state.codec.VirtualKeySerdesAdapterTest ‑ [12] com.hedera.node.app.service.mono.state.codec.VirtualBlobKey@c72d914e
com.hedera.node.app.service.mono.state.codec.VirtualKeySerdesAdapterTest ‑ [13] com.hedera.node.app.service.mono.state.codec.VirtualBlobKey@26d8436a
com.hedera.node.app.service.mono.state.codec.VirtualKeySerdesAdapterTest ‑ [14] com.hedera.node.app.service.mono.state.codec.VirtualBlobKey@81725f31
com.hedera.node.app.service.mono.state.codec.VirtualKeySerdesAdapterTest ‑ [15] com.hedera.node.app.service.mono.state.codec.VirtualBlobKey@b0ed13b7
com.hedera.node.app.service.mono.state.codec.VirtualKeySerdesAdapterTest ‑ [16] com.hedera.node.app.service.mono.state.codec.VirtualBlobKey@a00f2fc7
…

♻️ This comment has been updated with latest results.

github-actions · 2024-04-25T23:27:51Z

Node: HAPI Test (Smart Contract) Results

585 tests 585 ✅ 1h 5m 10s ⏱️
62 suites 0 💤
62 files 0 ❌

Results for commit dd516d0.

♻️ This comment has been updated with latest results.

cody-littley · 2024-04-26T12:21:35Z

...wirlds-platform-core/src/main/java/com/swirlds/platform/network/topology/StaticTopology.java

+        this.peerNodes =
+                Objects.requireNonNull(peers.stream().map(PeerInfo::nodeId).toList(), "peers must not be null");


The Objects.requireNonNull() is not necessary. If peers is null then peers.stream() will throw an exception first.

good point.

change not made in latest commit

cody-littley · 2024-04-26T12:25:04Z

...wirlds-platform-core/src/main/java/com/swirlds/platform/network/topology/StaticTopology.java

+     * @return the index of the node in the peer list
+     */
+    private int getIndexOfNodeId(@NonNull final NodeId nodeId) {
+        final int index = peerNodes.indexOf(nodeId);


Instead of iterating the list to find the index, we should instead create a Map<NodeId, Integer> for O(1) lookup time.

O(1) here would be nice but i still have to keep a list of peers for the neighbors. To use a map, the values of the map would be a set of peers which must still be converted to a list in O(n). No good trade-off here.

O(1) is non-optional for stuff like this.

Perhaps we can get O(1) with a much similar approach. Is there any reason we can't just compare node IDs directly? The index is really just a proxy for node IDs. All we really want to do is figure out if the other node comes first in the address book, and if you have a lower node ID you will always come first in the address book.

alittley

Why have the files sigCerts.data and agrCerts.data been added in this PR?

alittley · 2024-04-26T13:04:26Z

...wirlds-platform-core/src/main/java/com/swirlds/platform/network/topology/StaticTopology.java

@@ -76,19 +73,16 @@ public List<NodeId> getNeighbors() {
     */
    @Override
    public List<NodeId> getNeighbors(final Predicate<NodeId> filter) {


This method used to return neighbors, but now it's returning all peers.

the peers are all neighbors, but not necessarily adjacent.

I don't understand this. If all peers are neighbors, why do we have 2 terms here? To me, neighbor implies adjacent

alittley · 2024-04-26T13:33:34Z

...wirlds-platform-core/src/main/java/com/swirlds/platform/network/topology/StaticTopology.java

+     * @param nodeId the node ID
+     * @return the index of the node in the peer list
+     */
+    private int getIndexOfNodeId(@NonNull final NodeId nodeId) {


Won't this method have an off by 1 error for every node with index > self index? You're using position in peerNodes, but self is missing.

for every node with index > self index, we need the index returned by this method to be off by 1 (to match their position similar to the original addressBook).
There's an edge case for when being off by 1 can lead to this method returning an index number > peer list upper bound, but we deal with that correctly in the isAdjacent check so it's ok.

I don't understand this either. The previous code in this class did the following, to get the index of the other node in the address book. This computation includes self id in the indices

addressBook.getIndexOfNodeId(nodeId);

But the new code instead defines the index of the other node via its position in the peer list, which doesn't contain self id. So for every node with index greater than this self id, the returned node index will be 1 less.

Ok, I think I understand why this works.

If the node ID of the neighbor is greater than ours, then this method returns a value that is one less than its actual index. If the node ID of the neighbor is equal to our node ID plus one, getIndexOfNodeId() increments the number so that it doesn't actually equal our index.

This class still returns the correct results because we only care if the other node index is greater or less than our own index, so it doesn't matter that it's off by 1 as long as it is greater than our index.

Although this technically yields the correct result, I feel like this is one of those times where the code is "accidentally returning the right results". At the very least, the code here does not return values consistent with the javadocs. Better to refactor this code I think. I can imagine that the reason why/how this works may not be apparent to a developer who looks at this in the future.

cody-littley · 2024-04-26T13:41:15Z

...wirlds-platform-core/src/main/java/com/swirlds/platform/network/topology/StaticTopology.java

Why are we modifying this class right now? We discussed as a group a few weeks back, and as I recall the decision was to make the network fully connected and not to try and modify the partial connectivity code until we had the bandwidth to test it properly.

The danger of modifying this code now is that we don't actually have any integration tests where the network is not fully connected, meaning we may be introducing bugs we don't catch for a long time.

All that this change does is to get rid of the addressBook within the network code so we're operating on peers, making tests so much easier. AB is limiting to the effectiveness of writing tests, and also just a completely orthogonal concept to the network code. I have deliberately made some choices to closely mimic the old functionality (until we decide to revisit fully connected vs otherwise), like doing an index lookup via iteration vs map (for ex).

like doing an index lookup via iteration vs map

The existing code uses AddressBook::getIndexOfNodeId to determine node index, which has a map lookup under the hood. Where are you seeing index lookup via iteration?

StaticTopology.getNeighbors is one.

Anyway, doing away with node indices altogether has made this a lot easier as we no longer have to match the old behavior.

To fully do away with node indeces, we have to drop use of RandomGraph and officially only support fully connected networks until we introduce a new implementation of connecting to less peers than the number of nodes in the address book.

lpetrovic05 · 2024-04-26T13:42:54Z

platform-sdk/swirlds-platform-core/src/main/java/com/swirlds/platform/gossip/SyncGossip.java

        final List<PeerInfo> peers = Utilities.createPeerInfoList(addressBook, selfId);
+
+        topology =
+                new StaticTopology(random, peers, addressBook.getIndexOfNodeId(selfId), basicConfig.numConnections());


this index is an AB concept, so we should get rid of it

@lpetrovic05 Is the scope of this PR to simply remove the address book references or also drop our support for non-fully connected network topologies? The RandomGraph class uses hard-coded arrays of size addressbook.getSize(). The translation from id to index is to support the use of RandomGraph to decide who is a neighbor when there are more nodes in the network than the degree of edges allowed per node.

If we drop the use of RandomGraph, then we can get rid of the translation from id to index. But if we keep RandomGraph, then we still need to translate from id to index. We can do it without the address book, using the List<PeerInfo>, but we'll have to reconstruct the Map<id, index>, and be sensitive to where the selfId falls into the sequence.

It is also important to note that RandomGraph is returned and used in the FallenBehindManager where it gets the neighbors of the node based off of the selfId's mapping to index in the address book. To truly drop RandomGraph, we'd need to supply a replacement in other locations where it is used.

kfa-aguda · 2024-04-26T14:29:01Z

Why have the files sigCerts.data and agrCerts.data been added in this PR?

I need the peers to have certificates (though the cert doesn't have to be valid in this case). The certificate generator knows to look for those resource files, so it was easier to just give it the files. The alternative is to get the peers and add them mock certificates which is kind of hacky.

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

edward-swirldslabs · 2024-05-03T17:47:38Z

...wirlds-platform-core/src/main/java/com/swirlds/platform/network/topology/StaticTopology.java

+    @NonNull
+    private Map<NodeId, Long> map(@NonNull final List<PeerInfo> peers) {
+        for (final PeerInfo peer : peers) {
+            peerNodeToIdMap.put(peer.nodeId(), peer.nodeId().id());


This definition of a map is not meaningful. If you have the key, then you can call .id() and get the value without using the map.

I think your intent may be to create a new index ordering based on PeerInfo and map to indexes for use in compatibility with RandomGraph?

edward-swirldslabs · 2024-05-03T17:49:24Z

...wirlds-platform-core/src/main/java/com/swirlds/platform/network/topology/StaticTopology.java

-        final int selfIndex = addressBook.getIndexOfNodeId(selfId);
-        final int nodeIndex = addressBook.getIndexOfNodeId(nodeId);
-        return connectionGraph.isAdjacent(selfIndex, nodeIndex);
+        return peerNodeToIdMap.containsKey(nodeId);


You've dropped the isAdjacent check with connectionGraph. If the PR is just to remove the address book dependency then you still need to query the index from the map you construct once it has the right value in the key-value pair.

edward-swirldslabs · 2024-05-03T17:50:38Z

...wirlds-platform-core/src/main/java/com/swirlds/platform/network/topology/StaticTopology.java


 /**
 * A bidirectional topology that never changes.
 */
 public class StaticTopology implements NetworkTopology {
    private static final long SEED = 0;

-    private final NodeId selfId;
+    /** nodes are mapped so lookups are efficient. **/
+    private Map<NodeId, Long> peerNodeToIdMap = new HashMap<>();


The name of the variable is more appropriately named as nodeIdToIndexMap assuming the intent is to replace the address book lookup for index.

edward-swirldslabs · 2024-05-03T17:51:41Z

...wirlds-platform-core/src/main/java/com/swirlds/platform/network/topology/StaticTopology.java

     * @param selfId            the ID of this node
     * @param numberOfNeighbors the number of neighbors each node should have
     */
    public StaticTopology(
            @NonNull final Random random,
-            @NonNull final AddressBook addressBook,
+            @NonNull final List<PeerInfo> peers,


Is there an invariant on this list that the PeerInfo is in monotonically ascending order with respect to NodeId?

yes there is an invariant, but it's that PeerInfo are ordered as they appear in the original addressBook. I understand they should be monotonically ascending as defined in the AB, but I don't know if that's mandated anywhere.

edward-swirldslabs · 2024-05-03T18:02:55Z

...wirlds-platform-core/src/main/java/com/swirlds/platform/network/topology/StaticTopology.java

-                .filter(filter)
-                .toList();
+    public Set<NodeId> getNeighbors() {
+        return peerNodeToIdMap.keySet();


In the previous implementation this method called the predicate argument method. That method has been deleted. In the implementation of that deleted method, the definition of Neighbor was scoped to the connectIonGraph which is a RandomGraph that also models the number of allowed degrees or edges between nodes. If there are 100 nodes and each node has only 10 degrees in the graph then each node has 10 neighbors. The old implementation would provide the 10 neighbors a node is allowed to talk to. This implementation returns all nodes in the List<PeerInfo> as being allowed to talk to.

Discussed with @edward-swirldslabs. The StaticTopology implementation, given nodes ranging from 1-100, returned what's equivalent to the peersList (i.e. the list of peers) all the time. There's a specific test case for this. Agreed it did that based on what's returned by connectionGraph.getNeighbors() but this PR trivially returned the peers list, which is equivalent. It may be helpful to have a discussion on why that may not be right.

I understand there's been some historic discussions around what the future of this class should look like, but the goal of this PR is to introduce an equivalent logic that does not use the AB.

kfa-aguda added 6 commits April 24, 2024 16:24

12984 use peers for Network component

f23c2ac

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

12984 use peers for Network component

e27c958

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

Revert "12984 use peers for Network component"

5297e23

This reverts commit f23c2ac. Signed-off-by: Kore Aguda <kore@swirldslabs.com>

12984 use peers for Network component

9793eae

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

12984 use peers for Network component

2d70913

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

12984 use peers for Network component

5368df8

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

kfa-aguda changed the title ~~feat: Eliminate addressBook from Network component~~ feat: Eliminate addressBook from Network components Apr 25, 2024

kfa-aguda marked this pull request as ready for review April 26, 2024 01:02

kfa-aguda requested review from a team as code owners April 26, 2024 01:02

kfa-aguda requested review from edward-swirldslabs and timo0 April 26, 2024 01:02

kfa-aguda self-assigned this Apr 26, 2024

kfa-aguda requested review from cody-littley, alittley and lpetrovic05 April 26, 2024 01:03

cody-littley reviewed Apr 26, 2024

View reviewed changes

alittley reviewed Apr 26, 2024

View reviewed changes

cody-littley reviewed Apr 26, 2024

View reviewed changes

lpetrovic05 reviewed Apr 26, 2024

View reviewed changes

12984 use peers for Network component

3808eae

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

kfa-aguda added 4 commits April 29, 2024 23:29

Merge branch 'develop' into 12984-no-addressBook

e46101c

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

12984 use peers for Network component

79b929b

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

12984 use peers for Network component

74d668d

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

12984 use peers for Network component

dd516d0

Signed-off-by: Kore Aguda <kore@swirldslabs.com>

edward-swirldslabs requested changes May 3, 2024

View reviewed changes

		this.peerNodes =
		Objects.requireNonNull(peers.stream().map(PeerInfo::nodeId).toList(), "peers must not be null");

feat: Eliminate addressBook from Network components #13010

Are you sure you want to change the base?

feat: Eliminate addressBook from Network components #13010

Conversation

kfa-aguda commented Apr 25, 2024

github-actions bot commented Apr 25, 2024 • edited

Node: HAPI Test (Restart) Results

github-actions bot commented Apr 25, 2024 • edited

Node: HAPI Test (Node Death Reconnect) Results

github-actions bot commented Apr 25, 2024 • edited

Node: HAPI Test (Token) Results

github-actions bot commented Apr 25, 2024 • edited

Node: HAPI Test (Misc) Results

github-actions bot commented Apr 25, 2024 • edited

Node: HAPI Test (Crypto) Results

github-actions bot commented Apr 25, 2024 • edited

Node: HAPI Test (Time Consuming) Results

github-actions bot commented Apr 25, 2024 • edited

Node: Unit Test Results

github-actions bot commented Apr 25, 2024 • edited

Node: HAPI Test (Smart Contract) Results

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alittley left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kfa-aguda Apr 26, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cody-littley Apr 26, 2024 • edited

Choose a reason for hiding this comment

kfa-aguda Apr 26, 2024 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

edward-swirldslabs May 6, 2024 • edited

Choose a reason for hiding this comment

kfa-aguda commented Apr 26, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

edward-swirldslabs May 3, 2024 • edited

Choose a reason for hiding this comment

kfa-aguda May 3, 2024 • edited

Choose a reason for hiding this comment

github-actions bot commented Apr 25, 2024 •

edited

github-actions bot commented Apr 25, 2024 •

edited

github-actions bot commented Apr 25, 2024 •

edited

github-actions bot commented Apr 25, 2024 •

edited

github-actions bot commented Apr 25, 2024 •

edited

github-actions bot commented Apr 25, 2024 •

edited

github-actions bot commented Apr 25, 2024 •

edited

github-actions bot commented Apr 25, 2024 •

edited

kfa-aguda Apr 26, 2024 •

edited

cody-littley Apr 26, 2024 •

edited

kfa-aguda Apr 26, 2024 •

edited

edward-swirldslabs May 6, 2024 •

edited

edward-swirldslabs May 3, 2024 •

edited

kfa-aguda May 3, 2024 •

edited