Raft multi version testing #559

dhiaayachi · 2023-06-26T19:48:38Z

In the past when introducing features which present a compatibility risk in the raft library our testing strategy relied on having the feature behind a feature flag and test a mix of nodes where the feature is activated or deactivated. While this strategy have the benefit of being simple and avoiding the need to create multi processes tests where we test an older release of the raft library against a newer one, it present a risk as we are relying on the premise that, when the feature flag is off, the raft library still behave the same way as an older version of raft and any deviation of that would create a testing gap.

This PR is a proposal to introduce a new class of tests in the raft library to cover multiversion testing. The library in here permit to instantiate a new version of raft with the previously released version and give the ability to create tests scenario like upgrade tests (demonstrated in this PR as well).

From implementation perspective, the test suite is its own module. The module introduce 2 dependencies to raft:

A dependency to the new version, that represent the tip of the current branch (using a replace directive to the raft module in the current repo).
A dependency to the previously released version which represent the latest released tag (using a replace directive to a submodule which is pinned to the latest released version currently 1.5.0)

The test library introduce a testcluster package that represent the glue that permit to create and operate the 2 versions of raft in the same test.

A useful use case for this type of tests is the implementation of raft pre-vote. Using the same test strategy we could implement a multi versions tests (before prevote and after prevote) and verify that when transitioning between the 2 versions raft continue to operate safely.

TODO:

Run tests in CI
Automate updating the git submodule after a new release tag is created

rename test and pin submodule to version 1.5.0 rename test file

banks · 2023-06-27T09:41:15Z

raft-compat/rolling_upgrade_test.go

+		require.Equal(t, a, leader)
+
+		//
+		getLeader.GetRaft().(*raftprevious.Raft).RemoveServer(raftprevious.ServerID(rLatest.ID(i)), 0, 0)


Hmm I'm in two minds about this step... Consul's default mode for servers is to have leave_on_terminate = false which I think is correct - generally killing a server is either due to an in-place upgrade or a crash. In either case the intent is not to reduce the size of the quorum so not leaving makes sense.

The confusing part is that our docs since forever have actively recommended to manually perform a consul leave on the server before stopping it which I've never really liked for the above reasons. I think the benefit of doing so is that when the leader leaves gracefully leadership can transfeer more smoothly, but we now have Leadership Transfer that should explicitly handle that without messing with the quorum.

I think there was some discussion recently about how leaving first also increases the chances of leader disruption because the node leaves but then still interacts with the other raft nodes somehow but I don't recall the details.

I think it would be a better test if we didn't re-configure raft here as that is closer to how a straight replacement upgrade (without manual leave steps) would work.

FWIW those upgrade docs are being changed and in newer versions the user could force a leadership transfer when upgrading the leader instead of performing a leave.

Bad things can happen when a server leaves the cluster but will come back. Mainly that after leaving if raft on the node isn't turned off quickly enough it will attempt to kick off leader elections running up the term. The behavior we observed with a recent issue is that if that node runs up the term by X then when it rejoins it will trigger X leader elections until all other nodes agree on the term. This has the effect of destabilizing the cluster during the upgrade.

I think, independently from the upgrade procedure documented in Consul, we should have tests for all of those scenarios as they may happen during upgrade for different reasons and we need to ensure that the cluster end up stable at the end. I can set up a table test that include those scenarios.

@mkeeler The issue you are referring to is related to not having raft prevote implemented. Actually in the test there is an extra call to wait for leadership because of that and I added a comment about it not being necessary when we have prevote.

Also that make me realize that I'm changing the name of the nodes when upgrading, which do not really reflect an upgrade scenario, I will fix that.

banks · 2023-06-27T09:42:46Z

raft-compat/rolling_upgrade_test.go

+	store := rLatest.Store(getLeader.GetLocalID())
+
+	//Remove and shutdown the leader node
+	fr := getLeader.GetRaft().(*raftprevious.Raft).RemoveServer(raftprevious.ServerID(getLeader.GetLocalID()), 0, 0)


As above. Not sure if this is the "right" thing to do here 🤔

dhiaayachi · 2023-08-21T19:59:40Z

I added multiple upgrade scenario as discussed, we now test for upgrade:

When a node leave the cluster before rejoining with a new version
When a node restart with a new version without leaving the cluster
When a node, if it's a leader, perform a leader transfer before restarting with a new version
I think this would cover all the possible scenarios, including what we advice in Consul.

@banks @mkeeler could you please give this another round of review, I rebased the raft prevote branch on top of this, as I'm adding some upgrade tests for prevote too and would like to get both merged in the near term.

mkeeler

This seems fine to me but I wonder if longer term it would be easier to maintain a container based approach to multi-version testing (or if not containers then multiple compiled test binaries)

For example, if we had a testing only CLI which exposed all of the methods on the Raft via a gRPC API, then you could run these server testing CLIs and then communicate with any cluster generically through the gRPC API. This would avoid needing submodules, importing two versions of the raft library etc. It would come with the complexity of exposing a testing only gRPC API for managing Raft. This API could be relatively small to start with though to only expose just the methods that the tests need and only be expanded upon in the future.

dhiaayachi · 2024-03-21T16:06:26Z

This seems fine to me but I wonder if longer term it would be easier to maintain a container based approach to multi-version testing (or if not containers then multiple compiled test binaries)

For example, if we had a testing only CLI which exposed all of the methods on the Raft via a gRPC API, then you could run these server testing CLIs and then communicate with any cluster generically through the gRPC API. This would avoid needing submodules, importing two versions of the raft library etc. It would come with the complexity of exposing a testing only gRPC API for managing Raft. This API could be relatively small to start with though to only expose just the methods that the tests need and only be expanded upon in the future.

@mkeeler I considered an approach similar to the test-container approach we use in Consul. The main reason I decided to go with the approach in this PR is debuggability. Specially in library like raft, being able to see the full system state (all raft instances and their call stacks, code instrumentation using breakpoints...) is important to be able to understand what's going on while debugging. Also from fast iterating perspective the approach in here would be faster to iterate while adding features and bug fixes (no need to build a container image...). Add to that the fact that observing the system state using the current API is not always ideal and as you mentioned we need to write code to package the library into a standalone exec which could be as involving as the wrappers added in here.

That said, this approach comes with some drawbacks related to go static typing and needs to convert types between multiple versions. I tried to abstract as much as I can of that, but it's still involving and could be limiting. I suggest we try and see how far this approach get us and if the limits are too inconvenient we can switch to a container based testing solution in the future.

mkeeler · 2024-03-21T16:14:14Z

Actually, if you defined the gRPC API you could potentially do in-mem gRPC to each Raft instance while not having to care about types and while still allowing single instance easy debuggability.

Anyways, thats probably too future thinking for now.

One other drawback to the current single process approach is that it requires updating git submodules to test with other versions instead of being able to parameterize the test with some other version or versions to test with.

banks

This looks awesome. Thanks Dhia!

dhiaayachi added 19 commits June 6, 2023 10:00

add submodule and first iteration of multi-version tests

baf623c

rename test and pin submodule to version 1.5.0 rename test file

refactor test

22b88e4

clean up node init

0882d3b

clean up leader rolling upgrade

14dfa68

fix use of deprecate Leader method

a617c6d

extract cluster package

60f3232

export cluster Type

16eebcd

clean up tests and add test utils

3e9efaf

rename package to raftlatest

8b56ef3

remove submodule

2e8a26a

new submodule

01b88ad

fix go.mod

a7aeffa

change inmemConfig to be not exported

8ade0de

remove unused func

164e0c9

add replace rolling upgrade tests

6b0d70d

rename raft-latest to raft-previous

89da642

rename raft-latest to raft-previous submodule

7cde48c

fix submodule

9b03730

remove printf

0c3c507

dhiaayachi requested a review from a team as a code owner June 26, 2023 19:48

dhiaayachi requested review from loshz and removed request for a team June 26, 2023 19:48

banks reviewed Jun 27, 2023

View reviewed changes

dhiaayachi added 2 commits June 27, 2023 10:20

use same name for recycled servers, add other leave scenarios

b3dec1b

write upgrade tests that include prevotes

1aea16f

dhiaayachi requested review from mkeeler and banks March 19, 2024 13:15

run raft-compat as part of CI

63732e3

dhiaayachi requested a review from a team as a code owner March 19, 2024 14:20

dhiaayachi requested review from dekimsey and randyhdev and removed request for a team March 19, 2024 14:20

dhiaayachi added 6 commits March 19, 2024 10:36

fix CI to checkout submodule

3c95293

go mod tidy in ci

6e5ce8c

git submodules update

4b2c4a9

use go versions 1.20 and above

6b0d591

update submodule

049f56f

update .gitmodule url

0246dd1

mkeeler approved these changes Mar 21, 2024

View reviewed changes

banks approved these changes Mar 25, 2024

View reviewed changes

dhiaayachi merged commit cc2bb08 into hashicorp:main Mar 25, 2024
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Raft multi version testing #559

Raft multi version testing #559

dhiaayachi commented Jun 26, 2023 •

edited

banks Jun 27, 2023

mkeeler Jun 27, 2023

dhiaayachi Jun 27, 2023

dhiaayachi Jun 27, 2023 •

edited

banks Jun 27, 2023

dhiaayachi commented Aug 21, 2023 •

edited

mkeeler left a comment

dhiaayachi commented Mar 21, 2024

mkeeler commented Mar 21, 2024

banks left a comment

Raft multi version testing #559

Raft multi version testing #559

Conversation

dhiaayachi commented Jun 26, 2023 • edited

banks Jun 27, 2023

Choose a reason for hiding this comment

mkeeler Jun 27, 2023

Choose a reason for hiding this comment

dhiaayachi Jun 27, 2023

Choose a reason for hiding this comment

dhiaayachi Jun 27, 2023 • edited

Choose a reason for hiding this comment

banks Jun 27, 2023

Choose a reason for hiding this comment

dhiaayachi commented Aug 21, 2023 • edited

mkeeler left a comment

Choose a reason for hiding this comment

dhiaayachi commented Mar 21, 2024

mkeeler commented Mar 21, 2024

banks left a comment

Choose a reason for hiding this comment

dhiaayachi commented Jun 26, 2023 •

edited

dhiaayachi Jun 27, 2023 •

edited

dhiaayachi commented Aug 21, 2023 •

edited