Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Flaky Tests for Ubuntu 20.04 Job (Github Actions) #231

Open
richartkeil opened this issue Jan 16, 2022 · 2 comments
Open

Flaky Tests for Ubuntu 20.04 Job (Github Actions) #231

richartkeil opened this issue Jan 16, 2022 · 2 comments
Labels

Comments

@richartkeil
Copy link
Member

The job running the Containernet tests on an Ubuntu 20.04 host in Github Actions fails regularly. Cause is always that assertions like

self.assertTrue(self.net.pingAll() <= 0.0)

fail, because the ping returns a packet drop of 100.0. However they don't always fail, just enough to have to rerun jobs multiple time to make the tests pass.
This does not occur on the Ubuntu 18.04 host, nor could I reproduce it locally so far.

@ElectricalBoy
Copy link
Contributor

ElectricalBoy commented Mar 21, 2022

Seems like this issue is affecting all of my PRs (#238, #239, #240, #241) :(
What's interesting is that the set of failed tests for each GH action in Ubuntu 20.04 runner seems to vary every run. I think this flakiness is caused by the environment the GH runners are in.

Btw Ansible script containernet is using seems to be unable to detect docker preinstalled in GH runners, and attempts to install docker-ce alongside (which it succeeds). My theory (which is more like a random thought) is that the flakiness is coming from two different docker engines running simultaneously, but I haven't tested this theory (and it still doesn't explain why error is not occurring in 18.04 runner). (ignore this random thought - checking through APT showed that preinstalled Docker on GH action runner is automatically removed anyway)

@ElectricalBoy
Copy link
Contributor

https://github.com/mininet/mininet/blob/aa0176fce6fb718a03474f8719261b07b670d30d/.github/workflows/run-tests.yaml#L35

While checking through upstream Mininet repository I've found this in their test action runner - maybe this has some connection with Containernet's issue?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants