-
-
Notifications
You must be signed in to change notification settings - Fork 147
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Autoscaled nodes not joining cluster #466
Comments
Hi, can you share your full config file (minus the token)? |
|
Hi, this has been reported a couple of times before but I haven't had a chance to try and reproduce the problem yet. Can you share more details on how you have configured the network in Hetzner? The more details the better as they might help me understand where the problem might be. |
Hi, I have the same problem. Is there any known config to work? my network config is same as static workers, but after autoscaling and waiting some minutes, I cannot ssh into the autoscaled worker and it is not joining the cluster.
|
The problem is, the few reports I've come across about these issues all involve some custom commands to tweak the network settings, and that's something I haven't checked yet. So far, with the default network configuration, I haven't been able to recreate any of those problems. |
so it might have to do with "private network only" setup? - because thats my only real difference - using a NAT routing VM for Internet access from inside the cluster. |
I can't be sure because I haven't had a chance to verify this, but that's my suspicion at the moment. |
Same here. Just trying the generic cluster_config.yaml and the following section:
"medium-autoscaled" doesn't get created. Full config:
|
Hey @saashqdev, do you have any pods that are waiting for resources that aren't currently available in the cluster? If not, the autoscaler won't do anything. |
Ah, no I don't. I'll take it out then. Thanks |
You can keep it, so it scales automatically only when actually needed :) |
ok no problem. We still get the medium auto-scaled worker pools not showing tho |
So, as I was saying, the autoscaler will only add new nodes when there are pods waiting to be deployed and the cluster doesn’t have enough resources to handle them. If the cluster already has enough resources or if you’re running workloads without specifying the needed CPU and memory, the autoscaler won’t take any action. |
ok, got it - I can't get over how well this tool works... |
Since my last post to this issue, I exactly followed Vito's Documentation with a new cluster project. Autoscaler works with no issue, as long as you DO NOT USE PRIVATE IP. |
Thanks for confirming that. I need to find some time to test those other scenarios. |
Hello!
We are running Hetzner-k3s version 2.0.8 with the following worker pool config:
The nodes are created in Hetzner after autoscaling is initiated by stressing the cluster but they are not joining the cluster after that. We can ssh into the machines but they don't have for example ssh keys set or anything related to k3s installed. For static nodes the ssh keys are set correctly.
We think this has something to do with the previous cloud init wait problem in issue #379
If we read the code correctly the cloud_init_wait.sh script is not called when creating a autoscaled node?
We are running a private network only cluster. Regarding to this PR #458 our cloud init takes several minutes with both static and autoscaled nodes.
The text was updated successfully, but these errors were encountered: