Skip to content
This repository has been archived by the owner on Sep 4, 2021. It is now read-only.

aws: workaround for systemd #1312 #697

Conversation

mumoshu
Copy link
Contributor

@mumoshu mumoshu commented Sep 29, 2016

On top of #682 (comment), replaces a dependency from kubelet to the oneshot service decrypt-tls-assets with an ExecStartPre in the kubelet service because systemd doesn't seem to restart failed decrypt-tls-assets services that way.

Also, I've changed where to add ExecStartPre, instead of [Unit] sections as done in https://gist.github.com/spacepluk/a14f10cfed3756c0f1f079e73cdc6c9a#gistcomment-1885799, to [Service] sections as done in this PR.

Mainly purposed to fix #675 which I've encountered earlier, but would fix #682 too.

Although my cluster brought up via kube-aws with this change is working without any problem,
I have not yet a chance to reproduce #682 to test this.
Is anyone aware of a handy way to reproduce it?

cc @cgag @spacepluk

On top of coreos#682 (comment), replaces a dependency from kubelet to the oneshot unit `decrypt-tls-assets` with an ExecStartPre in the kubelet service because systemd doesn't seem to restart failed `decrypt-tls-assets` services that way.
@mumoshu mumoshu changed the title Workaround for systemd #1312 aws: Workaround for systemd #1312 Sep 29, 2016
@mumoshu mumoshu changed the title aws: Workaround for systemd #1312 aws: workaround for systemd #1312 Sep 29, 2016
@dghubble
Copy link
Member

dghubble commented Oct 19, 2016

I think this takes the systemctl is-active some.service trick/hack too far. That only works if the service itself has restarts, otherwise some of these changes "hope" the dependency is satisfied and fail if not. I don't think its useful for the oneshot for example. Requires= or Wants= are still the right way to do this.

The is-active flanneld was done on clusters which required a remote etcd to come online (via polling) before flannel starts and thus kubelet can start if I recall correctly. Starting etcd -> flannel -> kubelet should work here since flannel just depends on the local system's etcd.

@mumoshu
Copy link
Contributor Author

mumoshu commented Oct 19, 2016

@dghubble Thanks again for your feedback!

I guess things became a bit clearer for me now.

To sum up,

@aaronlevy
Copy link
Contributor

I'm going to close this PR, as the kube-aws work has moved to its own top-level repository here: https://github.com/coreos/kube-aws . If this is still something you would like to be merged, please open a PR under the new repo.

@aaronlevy aaronlevy closed this Nov 15, 2016
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
3 participants