Sticky IPs for StatefulSet #28969

bprashanth · 2016-07-14T17:23:47Z

Most of the databases I (https://github.com/kubernetes/kubernetes/tree/master/test/e2e/testing-manifests/petset) and others (#28718 (comment)) have prototyped seem to handle DNS properly, but there are murmurs that some do not (#23828 (comment)), and dont' have plans to do so (#28718 (comment)).

I'd still vote for defering any implementation till we have the end to end models fleshed out. One might imagine that databases understand the importance of DNS ttl.

chrislovecnm · 2016-07-15T01:00:56Z

I am checking to see if Cassandra can run on DNS fully. It works for lookup of seeds, but I am seeing that IP Addresses gets bound to a token range.

chrislovecnm · 2016-07-17T04:58:49Z

I have confirmed that Cassandra does need this. The Datastax team has toyed with the idea of using DNS, but at this time it is not support to use pure DNS with Cassandra.

bprashanth · 2016-07-17T05:00:44Z

Great! do you have a bug / doc for context

chrislovecnm · 2016-07-17T05:11:11Z

@bprashanth nope email from Datastax folks. They asked me to file a Jira if I wanted to recommend a change.

zefciu · 2016-07-18T13:27:00Z

I have an idea to implement this feature in calico-containers. I believe that a thing like sticky IPs is not something k8s is responsible for. I still however need a confirmation that this is a requirement for galera to get a blessing for this feature.

chrislovecnm · 2016-07-19T01:10:58Z

@zefciu with calico would this be able to work with internal IPs / DNS that PetSet uses? We need a pet to have a sticky IP address. Also I understand using calico, but this may want to be self contained in k8s.

chrislovecnm · 2016-07-19T01:12:14Z

To add more color sticky IP address that is in the internal private subnet the cluster and minions use. This is not a sticky public ip.

zefciu · 2016-07-19T06:27:06Z

The solution is to use annotations, that sent by CNI would make calico either use dynamic or static IPs. I don't know how can we solve the static IP logic in k8s itself if its outsourcing all the job of setting up network interfaces to plugins.

bprashanth · 2016-07-20T04:52:13Z

The way I would like to solve it is by defining a staitc-ip-subnet (just like podCIDR assigned to nodes), from which we'd draw these limited ips and assign them to pods. If a pod with the limited edition ip dies, we reprogram the routes so traffic flows to whichever node it lands, just like we setup routes today to route podCIDR to specific nodes.

The easier way to solve static ip is through a Service vip, but I don't like that for a couple of reasons (occupies iptables space, requires Svc per pod, wont work cross kube-cluster).

zefciu · 2016-07-20T08:33:50Z

@bprashanth: and how would your solution work with plugins? Would it simply sent this desired IP to the plugin, or will it take some of the plugin's responsibilities?

bprashanth · 2016-07-20T18:28:50Z

network plugins are responsible for allocating ips from a given range today, not between nodes. This range is the pod cidr. Something assigns podCIDRs and setups up routing (whatever that may be, it isn't a plugin yet. It could be CP specific route controller or something like flannel). The plugin will only be responsible for eg: creating a veth with the allocated ip and shoving in the netns. IPAM itself is a plugin within the CNI plugin.

magicwang-cn · 2016-08-02T04:04:46Z

init-containers also shares the same network with the whole pod, so why not get the sticky ips in the init-containers?

slaskawi · 2016-08-19T10:13:44Z

As for JBoss projects based on JGroups (like Infinispan for example), we probably need to write a new discovery protocol based on DNS (I proposed it on Infinispan dev mailing list and waiting for a response). Currently some of us use KUBE_PING (which queries Kubernetes API and collects containers) but trusting DNS would probably be a much better option.

However we (the Infinispan Team) would be very interested in exposing PetSets to the outside world. Our Hot Rod client can take the advantage of topology information and optimize queries. Having a public Sticky IPs (or anything that let's client decide to which Pod the request should be forwarded) would be very important for us.

chrislovecnm · 2016-10-10T05:05:18Z

@thockin this is the one I meant to comment on. Who is setting the priority of this one? Would a proposal be a good start?

thockin · 2016-10-10T05:11:43Z

A proposal is always a good start

krmayankk · 2016-12-29T08:47:08Z

@chrislovecnm are you writing this proposal ?

chrislovecnm · 2017-01-04T00:32:12Z

@krmayankk not had any time ... And frankly found a work around ish for Cassandra

k8s-github-robot · 2017-05-31T22:49:12Z

@bprashanth There are no sig labels on this issue. Please add a sig label by:
(1) mentioning a sig: @kubernetes/sig-<team-name>-misc
(2) specifying the label manually: /sig <label>

Note: method (1) will trigger a notification to the team. You can find the team list here.

caseydavenport · 2017-06-01T16:32:46Z

/sig network

braedon · 2017-08-20T21:54:58Z

Hi @chrislovecnm, we've having problems running Cassandra in stateful sets due to this - while we're waiting for a solution, could you share your workaround? Thanks!

fejta-bot · 2019-08-14T22:41:07Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

agolomoodysaada · 2019-08-15T13:15:21Z

/remove-lifecycle stale

fejta-bot · 2019-11-13T13:18:56Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

aspyct · 2019-11-13T13:28:33Z

/remove-lifecycle stale

vadalikrishna · 2019-11-26T14:02:26Z

@krmayankk not had any time ... And frankly found a work around ish for Cassandra

Can you please share your work around, I am facing a similar issue with cassandra in statefulset.

cscetbon · 2020-01-15T13:24:04Z

@vadalikrishna @krmayankk any information on the workaround would be much appreciated

madireddyr · 2020-01-16T06:42:54Z

@krmayankk @vadalikrishna chrislovecnm : any info how you resolved this issue

cscetbon · 2020-01-16T12:59:36Z

@brouberol What if two cassandra pods are going down at the same time?

Say pod A is running on k8s node 1, and pod B is running on k8s node 2.
They both go down at the same time.
What's to prevent pod A from starting again on node 2, and therefore getting the IP address that was previously assigned to pod B?
It's definitely a problem when both nodes are getting IPs of each other. Cassandra complains that an existing node already holds the tokens the current node is trying to get and refuses to start.

That's why if someone like @chrislovecnm has a workaround it would be much appreciated to know it.

@allamand Does someone have tested the option from calico :

annotations:
"cni.projectcalico.org/ipAddrs": "["192.168.0.1"]"

as you know, the issue is we can't assign different annotations to nodes belonging to a StatefulSet

fejta-bot · 2020-04-15T13:27:21Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

dpepper · 2020-05-08T15:03:23Z

Say pod A is running on k8s node 1, and pod B is running on k8s node 2.
They both go down at the same time.
What's to prevent pod A from starting again on node 2, and therefore getting the IP address that was previously assigned to pod B?
It's definitely a problem when both nodes are getting IPs of each other. Cassandra complains that an existing node already holds the tokens the current node is trying to get and refuses to start.

Did anyone find a solution for this?

allamand · 2020-05-08T15:54:02Z

You can force podA to stay on node1 using local storage and persistent volume claim

fejta-bot · 2020-06-07T16:12:55Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2020-07-07T16:54:55Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2020-07-07T16:55:13Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

arianvp · 2020-08-31T14:10:24Z

Can we reopen this? This is still very relevant for many workloads that now have issues running on kubernetes manually. Including Audio/Video (STUN/TURN etc), redis and cassandra.

arianvp · 2020-08-31T14:21:36Z

If you are using Calico, you could add https://docs.tigera.io/networking/use-specific-ip to each pod manually. To assign a fixed IP to each pod. However you'd have to add the annotation to new pods that get created when you scale up; which is a bit annoying.

Also it's racey; as the Pod is first created with a different IP; and then you change it after the fact; which is not ideal as some workloads will get very upset from that (e.g. cassandra) which is what we're trying to prevent with this issue in the first place. No. you cabn only set these fields during pod creation time unfortunately

It would be really cool if we could add an annotation to the StatefulSet that Calico interprets and then does this automatically. Listen to pods being created; and then add a fixed IP to the pod

alitoufighi · 2021-05-03T10:16:23Z

Can we have this issue reopened?

bprashanth added team/cluster area/stateful-apps labels Jul 14, 2016

bprashanth mentioned this issue Jul 14, 2016

Support mysql galera in PetSet #23828

Closed

bprashanth mentioned this issue Jul 15, 2016

Pet Set in beta #28718

Closed

k8s-github-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label May 31, 2017

k8s-ci-robot added the sig/network Categorizes an issue or PR as relevant to SIG Network. label Jun 1, 2017

k8s-github-robot removed the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Jun 1, 2017

0xmichalis added sig/apps Categorizes an issue or PR as relevant to SIG Apps. and removed team/cluster (deprecated - do not use) labels Jun 3, 2017

enisoc changed the title ~~sticky ips for petset~~ Sticky IPs for StatefulSet Sep 7, 2017

freehan added kind/feature Categorizes issue or PR as related to a new feature. and removed triage/unresolved Indicates an issue that can not or will not be resolved. labels May 16, 2019

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 14, 2019

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 15, 2019

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 13, 2019

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 13, 2019

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 15, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jun 7, 2020

k8s-ci-robot closed this as completed Jul 7, 2020

Workloads automation moved this from Backlog to Done Jul 7, 2020

arianvp mentioned this issue Apr 7, 2021

K8SSAND-49 ⁃ Blog post or documentation on how to replace broken (or potentially broken) nodes? k8ssandra/k8ssandra#609

Open

sync-by-unito bot mentioned this issue May 19, 2021

Blog post or documentation on how to replace broken (or potentially broken) nodes? k8ssandra/cass-operator#78

Closed

Sticky IPs for StatefulSet #28969

Sticky IPs for StatefulSet #28969

Comments

bprashanth commented Jul 14, 2016

chrislovecnm commented Jul 15, 2016

chrislovecnm commented Jul 17, 2016

bprashanth commented Jul 17, 2016

chrislovecnm commented Jul 17, 2016

zefciu commented Jul 18, 2016

chrislovecnm commented Jul 19, 2016

chrislovecnm commented Jul 19, 2016 • edited

zefciu commented Jul 19, 2016

bprashanth commented Jul 20, 2016

zefciu commented Jul 20, 2016

bprashanth commented Jul 20, 2016

magicwang-cn commented Aug 2, 2016 • edited

slaskawi commented Aug 19, 2016

chrislovecnm commented Oct 10, 2016

thockin commented Oct 10, 2016

krmayankk commented Dec 29, 2016

chrislovecnm commented Jan 4, 2017

k8s-github-robot commented May 31, 2017

caseydavenport commented Jun 1, 2017

braedon commented Aug 20, 2017

fejta-bot commented Aug 14, 2019

agolomoodysaada commented Aug 15, 2019

fejta-bot commented Nov 13, 2019

aspyct commented Nov 13, 2019

vadalikrishna commented Nov 26, 2019

cscetbon commented Jan 15, 2020

madireddyr commented Jan 16, 2020

cscetbon commented Jan 16, 2020

fejta-bot commented Apr 15, 2020

dpepper commented May 8, 2020

allamand commented May 8, 2020

fejta-bot commented Jun 7, 2020

fejta-bot commented Jul 7, 2020

k8s-ci-robot commented Jul 7, 2020

arianvp commented Aug 31, 2020

arianvp commented Aug 31, 2020 • edited

alitoufighi commented May 3, 2021

chrislovecnm commented Jul 19, 2016 •

edited

magicwang-cn commented Aug 2, 2016 •

edited

arianvp commented Aug 31, 2020 •

edited