New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
kube-apiserver not given enough time to start #6702
Comments
@alok87: I was wondering why the frisbee was getting bigger, then it hit me. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Update: I increased the ec2 instance size from m3.medium to m5.large and that fixed the issue. I guess the instance was a bit overloaded on startup and that is why |
Thanks, @dobesv - I think you're right here. (I was actually involved in the upstream issue as well, as you can see!) I also think you're right that we should look at changing to a more "modern" instance type; it is technically a behavioural change, but it shouldn't be a breaking change, and I think it is in keeping with the idea that if you don't specify an instance type it means "choose for me", rather than "use t2.medium". |
Fix kubernetes#6702 Parallel to upstream issue #71054
Fix kubernetes#6702 Parallel to upstream issue #71054
1. What
kops
version are you running? The commandkops version
, will displaythis information.
2. What Kubernetes version are you running?
kubectl version
will print theversion if a cluster is running or provide the Kubernetes version specified as
a
kops
flag.3. What cloud provider are you using?
AWS
4. What commands did you run? What is the simplest way to reproduce this issue?
5. What happened after the commands executed?
Masters did not come online and ready reliably after restart or on first start. Sometimes they do and sometimes they don't.
6. What did you expect to happen?
Masters come online without problems.
7. Please provide your cluster manifest. Execute
kops get --name my.example.com -o yaml
to display your cluster manifest.You may want to remove your cluster name and other sensitive information.
https://gist.github.com/dobesv/12c2826b9b658b2ead290eca0c63acdf
8. Please run the commands with most verbose logging by adding the
-v 10
flag.Paste the logs into this report, or in a gist and provide the gist link here.
Excerpt of
kube-apiserver.log
:https://gist.github.com/dobesv/d1897bf839dc095c074cba6612ad246e
9. Anything else do we need to know?
After some poking around, the issue seems to be that
kube-apiserver
takes more than 30 seconds to fully start up, sometimes. ThelivenessProbe
is configured with a 15 seconds delay and 15 seconds timeout. So, thekube-apiserver
is marked for termination just before it actually starts up. It gets a 30 second grace period before it is terminated bykubelet
.I am not quite sure whether it is normal for
kube-apiserver
to take >30s to start up, perhaps there's something wrong with my setup that is slowing that process down unreasonably.However, if I ssh into the server and increase the timeouts on the
livenessProbe
forkube-apiserver
to60
for both instead of15
the master comes online OK.I guess my proposal here is to increase the default
livenessProbe
initialDelay
(maybe to 60 seconds) or make it configurable.See Also
The text was updated successfully, but these errors were encountered: