no connection for cached dial! for eks cluster #2677

exinos-git · 2024-04-25T13:53:47Z

Describe the bug
cannot connect to EKS cluster after credentials expire and are refreshed
get "no connection for cached dial!"

To Reproduce
Steps to reproduce the behavior:

connect to an EKS cluster
wait for credentials to expire
login to AWS again to refresh creds
try to connect to EKS cluster with k9s

Historical Documents
1932 9:00AM INF <U+2705> Kubernetes connectivity 1933 9:00AM ERR Fail to load global/context configuration error="the server has asked for the client to provide credentials\nk9s config file "/home/someuser/.config/k9s/config.yaml" load failed:\nAdditional pr 1933 operty fullScreen is not allowed\ncannot connect to context: arn:aws:eks:someregion::cluster/blahblah\nk8s connection failed for context: arn:aws:eks:somregeion::cluster/blahblah" 1934 9:00AM ERR Load cluster resources - No API server connection
1935 9:00AM ERR failed to list contexts error="no connection"
1936 9:00AM WRN Unable to dial discovery API error="no connection to dial"
1937 9:00AM ERR can't connect to cluster error="the server has asked for the client to provide credentials"
1938 9:00AM ERR Load cluster resources - No API server connection
1939 9:00AM WRN Unable to dial discovery API error="no connection to dial"
1940 9:00AM ERR Context switch failed error="no connection to cached dial"
1941 9:00AM ERR no connection to cached dial
1942 9:00AM ERR Context switch failed error="no connection to cached dial"
1943 9:00AM ERR no connection to cached dial
1944 9:00AM ERR Context switch failed error="no connection to cached dial"
1945 9:00AM ERR no connection to cached dial
1946 9:00AM ERR Context switch failed error="no connection to cached dial"
1947 9:00AM ERR no connection to cached dial
1948 9:00AM ERR Context switch failed error="no connection to cached dial"
1949 9:00AM ERR no connection to cached dial
1950 9:00AM ERR Context switch failed error="no connection to cached dial"
1951 9:00AM ERR no connection to cached dial
1952 9:00AM ERR Context switch failed error="no connection to cached dial"
1953 9:00AM ERR no connection to cached dial
1954 9:00AM ERR Context switch failed error="no connection to cached dial"
1955 9:00AM ERR no connection to cached dial
1956 9:00AM ERR Context switch failed error="no connection to cached dial"
1957 9:00AM ERR no connection to cached dial
1958 9:00AM ERR Context switch failed error="no connection to cached dial"
1959 9:00AM ERR no connection to cached dial
1960 9:00AM ERR Context switch failed error="no connection to cached dial"
1961 9:00AM ERR no connection to cached dial
1962 9:00AM ERR Context switch failed error="no connection to cached dial"
1963 9:00AM ERR no connection to cached dial
1964 9:00AM ERR Context switch failed error="no connection to cached dial"
1965 9:00AM ERR no connection to cached dial
1966 9:00AM ERR Context switch failed error="no connection to cached dial"
1967 9:00AM ERR no connection to cached dial
1968 9:00AM ERR Context switch failed error="no connection to cached dial"
1969 9:00AM ERR no connection to cached dial
1970 9:00AM ERR Context switch failed error="no connection to cached dial"
1971 9:00AM ERR no connection to cached dial
1972 9:00AM ERR Context switch failed error="no connection to cached dial"
1973 9:00AM ERR no connection to cached dial
1974 9:00AM ERR Context switch failed error="no connection to cached dial"
1975 9:00AM ERR no connection to cached dial

Expected behavior
it refreshes the connection with new creds

Screenshots

Versions (please complete the following information):

OS: [e.g. WSL2]
K9s: [e.g. v0.32.4]
K8s: [e.g. 1.27.12]

Additional context
the only way i could work around this was by moving mv /home/someuser/.local/share/k9s/clusters /home/someuser/.local/share/k9s/clustersbad

pdfrod · 2024-04-30T18:36:46Z

A few weeks ago I also started to have "no connection for cached dial" errors all of the sudden. I've used k9s for more than a year and never had that problem before. In my case I'm connecting to GKE clusters.

If I try to reach the clusters using kubectl it works perfectly, but for some reason I need to do a lot of retries in k9 before it will let me access the clusters. I tried upgrading to the latest k9s version, but the issue persists.

exinos-git · 2024-04-30T18:40:05Z

@pdfrod did you try the workaround i mention mv /home/$USER/.local/share/k9s/clusters /home/$USER/.local/share/k9s/clustersbad

pdfrod · 2024-04-30T18:47:45Z

Just tried it, but it didn't make any difference for me unfortunately.

cablekevin · 2024-05-01T07:22:09Z

Unfortunately I'm also running into this same issue.

After sourcing my new AWS temp credentials with MFA if i start k9s i have to wait several seconds for the context to be loaded properly and it starts working. However sometimes it doesn't load properly and I'm stuck with: "no connection to cached dial".

Version: v0.32.4
Commit: d3027c8
Date: 2024-03-20T19:16:59Z

olivierlacan · 2024-05-03T22:13:35Z

Having the same issue, in some cases k9s appears to reload itself and somehow the issue resolves itself but I'm not quite sure how to trigger it. I tried switching between clusters or hitting ctrl + r.

I even tried to re-authenticate outside of k9s but the UI eventually seemed to refresh on its own after several seconds. It might be helpful to be able to trigger whatever refresh process seemingly happens in the background manually either when refreshing with ctrl + r or with another command.

eric-gt · 2024-05-07T20:52:56Z

I ran into this problem today with clusters in both EKS and GKE, and here's how I solved it:

rename the current k9s config clusters folder to clustersbad with @exinos-git 's mv command. or delete it. your choice
a. N.B.: if you're on OSX, the default K9s config directory is at ~/Library/Application\ Support/k9s
re-authenticate to your clusters out-of-band and update the kubeconfig
a. for EKS aws eks update-kubeconfig --name {cluster name} --region {cluster region}
b. for GKE gcloud container clusters get-credentials {cluster name} --region {cluster region}
run K9s

After following these three steps, k9s automatically boots into the last context I connected to.

I believe what happened in my case was that I updated the names of my contexts in my ~/.kube/config file directly, instead of renaming them in k9s, and that screwed up the mappings between my kubeconfig contexts and the cluster configurations in k9s

wolffberg · 2024-05-08T10:12:16Z

Most likely a duplicate of #2651

pdfrod · 2024-05-15T09:03:06Z

Most likely a duplicate of #2651

Yes, in my case #2651 was exactly the problem I was having. Setting a current-context fixed the problem for me, although it would be nice to not have to set one, as I have multiple clusters and I prefer to be explicit about the cluster I'm currently using.

syselement · 2024-05-16T07:27:33Z

Most likely a duplicate of #2651

Yes, in my case #2651 was exactly the problem I was having. Setting a current-context fixed the problem for me, although it would be nice to not have to set one, as I have multiple clusters and I prefer to be explicit about the cluster I'm currently using.

+1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

no connection for cached dial! for eks cluster #2677

no connection for cached dial! for eks cluster #2677

exinos-git commented Apr 25, 2024

pdfrod commented Apr 30, 2024

exinos-git commented Apr 30, 2024

pdfrod commented Apr 30, 2024

cablekevin commented May 1, 2024

olivierlacan commented May 3, 2024

eric-gt commented May 7, 2024 •

edited

wolffberg commented May 8, 2024

pdfrod commented May 15, 2024

syselement commented May 16, 2024

no connection for cached dial! for eks cluster #2677

no connection for cached dial! for eks cluster #2677

Comments

exinos-git commented Apr 25, 2024

pdfrod commented Apr 30, 2024

exinos-git commented Apr 30, 2024

pdfrod commented Apr 30, 2024

cablekevin commented May 1, 2024

olivierlacan commented May 3, 2024

eric-gt commented May 7, 2024 • edited

wolffberg commented May 8, 2024

pdfrod commented May 15, 2024

syselement commented May 16, 2024

eric-gt commented May 7, 2024 •

edited