worker-node does not get ready after reboot #11425
Labels
kind/bug
Categorizes issue or PR as related to a bug.
lifecycle/stale
Denotes an issue or PR has remained open with no activity and has become stale.
What happened?
In a kubespray-cluster with a single control-plane:
When rebooting a worker-node (without any control-plane-components), the node does not get ready again.
What did you expect to happen?
The worker-node gets ready again after the reboot.
How can we reproduce it (as minimally and precisely as possible)?
Deploy a kubespray-cluster with a single control-plane.
Reboot a worker-node without draining it before.
OS
We run ansible via gitlab-ci with
quay.io/kubespray/kubespray:v2.25.0
, so the versions are:Version of Ansible
Version of Python
Version of Kubespray (commit)
7e0a40725
(which isv2.25.0
)Network plugin used
calico
Full inventory with variables
https://gist.github.com/rdxmb/099f6ebd3979369f059a1efdc18f0ec2
Command used to invoke ansible
Output of ansible run
--- anything is ok here, so I do not post the output ---
Anything else we need to know
For me it seems to be kind of a hen's egg problem:
kubelet cannot connect the apiserver via
localhost:6443
, where thenginx-proxy-node-[n]
should run and route to the kubernetes-apiserver.nginx-proxy-node-[n]
cannot get ready because kubelet is not working correctly ...There is also a backup-file created by kubespray with the correct server-ip included:
Workaround
Now the node gets ready again. 🎉
Just some more information:
The text was updated successfully, but these errors were encountered: