You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I wanted to update the machine types of the Azure k8s worker nodes and the Azure loadbalancer node. The workflow failed on InstallVPN on Ansibler with the following error in the ansibler.
2024-03-13T13:02:03Z WRN Retrying command ansible-playbook ../../ansible-playbooks/wireguard.yml -i inventory.ini -f 15 ... (4/5) module=ansibler
2024-03-13T13:07:05Z WRN Error encountered while executing ansible-playbook ../../ansible-playbooks/wireguard.yml -i inventory.ini -f 15 : exit status 2 module=ansibler
2024-03-13T13:07:05Z ERR failed to execute cmd: ansible-playbook ../../ansible-playbooks/wireguard.yml -i inventory.ini -f 15 :
azure-compute-lz9kdeo-1: failed
task: Wait 300 seconds for target connection to become reachable/usable
summary: timed out waiting for ping module test: [Errno None] Unable to connect to port 22 on 4.184.250.149 module=ansibler
2024-03-13T13:07:05Z INF Next retry in 160s... module=ansibler
2024-03-13T13:09:45Z WRN Retrying command ansible-playbook ../../ansible-playbooks/wireguard.yml -i inventory.ini -f 15 ... (5/5) module=ansibler
2024-03-13T13:14:46Z WRN Error encountered while executing ansible-playbook ../../ansible-playbooks/wireguard.yml -i inventory.ini -f 15 : exit status 2 module=ansibler
2024-03-13T13:14:46Z ERR Command ansible-playbook ../../ansible-playbooks/wireguard.yml -i inventory.ini -f 15 was not successful after 5 retries module=ansibler
2024-03-13T13:14:46Z ERR Error encountered while installing VPN error="error while running ansible for services/ansibler/server/clusters/jakub-ine9sp1-8i4gib3 : exit status 2:\n\tazure-compute-lz9kdeo-1: failed\n\ttask: Wait 300 seconds for target connection to become reachable/usable\n\tsummary: timed out waiting for ping module test: [Errno None] Unable to connect to port 22 on 4.184.250.149" cluster=jakub module=ansibler project=default-jakub
Also, the builder pod was restarted after the failed run. To see the error logs I had to use --previous flag.
2024-03-13T12:34:39Z INF Config finished building module=builder project=default-jakub
2024-03-13T12:37:42Z INF Processing cluster cluster=jakub module=builder
2024-03-13T12:37:42Z INF Calling BuildInfrastructure on Terraformer cluster=jakub-ine9sp1 module=builder project=default-jakub
2024-03-13T12:39:29Z INF BuildInfrastructure on Terraformer finished successfully cluster=jakub-ine9sp1 module=builder project=default-jakub
2024-03-13T12:39:29Z INF Calling InstallVPN on Ansibler cluster=jakub-ine9sp1 module=builder project=default-jakub
2024-03-13T13:05:10Z INF Received signal terminated module=builder
2024-03-13T13:05:10Z INF Builder stopped checking for new configs module=builder
2024-03-13T13:05:10Z INF Waiting for already started configs to finish processing module=builder
2024-03-13T13:14:46Z ERR Failed to build cluster error="error in Ansibler for cluster jakub project default-jakub : error while calling InstallVPN on Ansibler: rpc error: code = Unknown desc = error encountered while installing VPN for cluster jakub project default-jakub : error while running ansible for services/ansibler/server/clusters/jakub-ine9sp1-8i4gib3 : exit status 2:\n\tazure-compute-lz9kdeo-1: failed\n\ttask: Wait 300 seconds for target connection to become reachable/usable\n\tsummary: timed out waiting for ping module test: [Errno None] Unable to connect to port 22 on 4.184.250.149" cluster=jakub module=builder
2024-03-13T13:14:46Z ERR Error encountered while processing config error="error in Ansibler for cluster jakub project default-jakub : error while calling InstallVPN on Ansibler: rpc error: code = Unknown desc = error encountered while installing VPN for cluster jakub project default-jakub : error while running ansible for services/ansibler/server/clusters/jakub-ine9sp1-8i4gib3 : exit status 2:\n\tazure-compute-lz9kdeo-1: failed\n\ttask: Wait 300 seconds for target connection to become reachable/usable\n\tsummary: timed out waiting for ping module test: [Errno None] Unable to connect to port 22 on 4.184.250.149" module=builder project=default-jakub
2024-03-13T13:14:46Z INF Stopping Builder : http: Server closed module=builder
Expected Behaviour
Claudie updates the machine types of the nodes in the running cluster without any issues.
When the workflow finishes, replace the machine type for azure-compute with Standard_B4ms and the machine type for azure-lb with Standard_B2s. After the changes, the manifest should look like this.
Current Behaviour
I wanted to update the machine types of the Azure k8s worker nodes and the Azure loadbalancer node. The workflow failed on
InstallVPN on Ansibler
with the following error in theansibler
.Also, the
builder
pod was restarted after the failed run. To see the error logs I had to use--previous
flag.Expected Behaviour
Claudie updates the machine types of the nodes in the running cluster without any issues.
Steps To Reproduce
azure-compute
withStandard_B4ms
and the machine type forazure-lb
withStandard_B2s
. After the changes, the manifest should look like this.Anything else to add
Maybe check if this error appears in other cloud providers.
The text was updated successfully, but these errors were encountered: