🐛 Reconcile etcd members on control plane scale down #265

Danil-Grigorev · 2024-02-14T12:07:58Z

kind/bug

What this PR does / why we need it:
This change allows to establish connectivity to child cluster etcd members, and manage membership during cluster scaling. Specifically this is required when a cluster etcd leader is removed by scaling down procedure, as this causes the cluster API server to become unavailable and never come back online.

Therefore the etcd leader needs to be moved just before requesting node deletion, and etcd membership has to be adjusted as well.

RKE2 follows a different certificate management model opposed to CAPI, therefore we can’t provide CAPI certificates on node bootstrap, and instead have to fetch the generated certificate by the rke2 server during bootstrapping.

New clusters will use regular CAPI certificate management model, where the certificates are provided to the RKE2 agent on the initialization and are generated if missing by cabprke2. Therefore, for the time being there will be 2 co-existing cluster configurations, which will be reduced to the upstream one by performing certificate rotation in the future.

Existing clusters will not be required to perform scale up in order for the fix to take effect. Regular upgrade will do this, even with MaxSurge=0 setting. For them, if both local etcd secret, and child bootstrapped etcd secrets are missing, no etcd operations will be performed. First upgraded node, however, will be the “scale up” to populate the child cluster secret. This way it is guarantied that every cluster will eventually get the fix.

Which issue(s) this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged):
Fixes #263

Special notes for your reviewer:

Checklist:

squashed commits into logical changes
includes documentation
adds unit tests
adds or updates e2e tests

Danil-Grigorev · 2024-02-20T10:58:33Z

Current implementation is passing e2e tests with testing a single node upgrade and scaling to 1 scenarios. This will allow to preserve existing clusters, but in order to apply the fix, the cluster control plane replicas will have to be scaled up one node at some moment. Will explore different approach with passing generated certificates to the rke2 server. This may be closer to the upstream, but will require all existing clusters to be re-created, or have some etcd migration mechanism (like rke2 certificate rotation to manually supplied, if such thing is supported).

Ran 2 of 2 Specs in 6350.292 seconds
SUCCESS! -- 2 Passed | 0 Failed | 0 Pending | 0 Skipped
PASS

Danil-Grigorev · 2024-02-20T16:14:58Z

@richardcase Presubmit e2e job passed without issues here :) And under 30 minutes.

alexander-demicev

thanks a lot for taking care of this problem

test/e2e/e2e_test.go

Danil-Grigorev · 2024-04-08T14:41:04Z

e2e are failing after rebase and the output is very cryptic. I’m also seeing a bug in the RKE2 code (non-changed) related to PodFailedReason condition - once it is set, it is no longer updated, causing precondition checks to fail.

Signed-off-by: Danil Grigorev <[email protected]>

- Disc pressure fix for kube-vip Signed-off-by: Danil Grigorev <[email protected]>

Signed-off-by: Danil Grigorev <[email protected]>

Danil-Grigorev · 2024-04-12T17:04:14Z

Tests are green again, there were some issues with the code, some problems are however in CAPI framework. It appears multiple machine sets are created per machine deployments in upgrade scenario. Testing code is unable to distinguish between those.

Kube-vip is not helpful as the pods are getting evicted due to MemoryPressure on the child cluster nodes, and the default set of tolerations are for some reason ignored there. Sometimes tests may flake, because with no load balancing solution may cause RKE2 agent to connect on a non-existing (recently removed) node. Other issue I observed with kube-vip, is that leader election is never able to release a lock from a dead pod on a node where the etcd instance is offline. This is likely a client-go issue.

That being said, the PR is ready to be merged, as the functionality is consistent with description.

richardcase

Looks ok to me.

@Danil-Grigorev - we should also follow-up on the Kube-vip is not helpful comment.

furkatgofurov7

Thanks, LGTM

Danil-Grigorev force-pushed the reconcile-etcd-members-scale-down branch 5 times, most recently from 5334096 to d6bc7ea Compare February 19, 2024 10:15

furkatgofurov7 added this to the v0.3.0 milestone Feb 19, 2024

Danil-Grigorev force-pushed the reconcile-etcd-members-scale-down branch from 06c1752 to 5288427 Compare February 19, 2024 15:09

Danil-Grigorev added the kind/bug Something isn't working label Feb 19, 2024

Danil-Grigorev changed the title ~~[WIP] Reconcile etcd members on control plane scale down~~ 🐛 Reconcile etcd members on control plane scale down Feb 19, 2024

Danil-Grigorev added kind/bug Something isn't working and removed kind/bug Something isn't working labels Feb 20, 2024

Danil-Grigorev force-pushed the reconcile-etcd-members-scale-down branch from 5288427 to 5e6a700 Compare February 20, 2024 15:27

Danil-Grigorev force-pushed the reconcile-etcd-members-scale-down branch from 5e6a700 to 0e2909f Compare February 20, 2024 17:08

alexander-demicev previously approved these changes Mar 15, 2024

View reviewed changes

salasberryfin previously approved these changes Mar 27, 2024

View reviewed changes

rumycoding reviewed Mar 28, 2024

View reviewed changes

test/e2e/e2e_test.go Show resolved Hide resolved

richardcase mentioned this pull request Apr 8, 2024

Proposal for managing Managed ETCD in k3s provider k3s-io/cluster-api-k3s#97

Closed

Danil-Grigorev dismissed stale reviews from salasberryfin and alexander-demicev via aa1f3fb April 8, 2024 11:17

Danil-Grigorev force-pushed the reconcile-etcd-members-scale-down branch 2 times, most recently from aa1f3fb to 9b55b97 Compare April 8, 2024 11:22

Danil-Grigorev force-pushed the reconcile-etcd-members-scale-down branch 3 times, most recently from d48700a to aaa9725 Compare April 8, 2024 19:34

Danil-Grigorev added 3 commits April 8, 2024 21:35

Add etcd membeship change logic for etcd scale down scenarios

f5d65f1

Signed-off-by: Danil Grigorev <[email protected]>

Add bootstrap commnad to store etcd certificates from the node

8c04dbc

Signed-off-by: Danil Grigorev <[email protected]>

Adopt etcd setup from rke2 for etcd membership changing logic

c7fe27c

Signed-off-by: Danil Grigorev <[email protected]>

Danil-Grigorev force-pushed the reconcile-etcd-members-scale-down branch 12 times, most recently from e6bda45 to 58bb752 Compare April 11, 2024 15:06

Update licences and fix lint errors

9025c94

Signed-off-by: Danil Grigorev <[email protected]>

Danil-Grigorev force-pushed the reconcile-etcd-members-scale-down branch from 58bb752 to 04b620d Compare April 12, 2024 10:37

Danil-Grigorev added 2 commits April 12, 2024 13:21

Use kube-vip for e2e tests

c4b63e2

Signed-off-by: Danil Grigorev <[email protected]>

Add crust-gather collection for e2e run

1ad956d

Signed-off-by: Danil Grigorev <[email protected]>

Danil-Grigorev force-pushed the reconcile-etcd-members-scale-down branch from 04b620d to 1ad956d Compare April 12, 2024 11:21

Fix machine discovery in test suite

8ef8e20

- Disc pressure fix for kube-vip Signed-off-by: Danil Grigorev <[email protected]>

Danil-Grigorev force-pushed the reconcile-etcd-members-scale-down branch 2 times, most recently from 8346abb to 734414e Compare April 12, 2024 14:46

Exclude kube-vip

5289859

Signed-off-by: Danil Grigorev <[email protected]>

Danil-Grigorev force-pushed the reconcile-etcd-members-scale-down branch from 734414e to 5289859 Compare April 12, 2024 15:53

alexander-demicev approved these changes Apr 17, 2024

View reviewed changes

richardcase approved these changes Apr 19, 2024

View reviewed changes

furkatgofurov7 approved these changes Apr 19, 2024

View reviewed changes

Danil-Grigorev merged commit e27bcbd into rancher:main Apr 19, 2024
7 checks passed

Danil-Grigorev deleted the reconcile-etcd-members-scale-down branch April 19, 2024 10:46

furkatgofurov7 mentioned this pull request Apr 19, 2024

Bump CAPI to v1.7.0 minor release #299

Merged

4 tasks

tmmorin mentioned this pull request Oct 1, 2024

ETCD becomes unavailable on update #449

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 Reconcile etcd members on control plane scale down #265

🐛 Reconcile etcd members on control plane scale down #265

Danil-Grigorev commented Feb 14, 2024 •

edited

Loading

Danil-Grigorev commented Feb 20, 2024

Danil-Grigorev commented Feb 20, 2024

alexander-demicev left a comment

Danil-Grigorev commented Apr 8, 2024

Danil-Grigorev commented Apr 12, 2024

richardcase left a comment

furkatgofurov7 left a comment

🐛 Reconcile etcd members on control plane scale down #265

🐛 Reconcile etcd members on control plane scale down #265

Conversation

Danil-Grigorev commented Feb 14, 2024 • edited Loading

Danil-Grigorev commented Feb 20, 2024

Danil-Grigorev commented Feb 20, 2024

alexander-demicev left a comment

Choose a reason for hiding this comment

Danil-Grigorev commented Apr 8, 2024

Danil-Grigorev commented Apr 12, 2024

richardcase left a comment

Choose a reason for hiding this comment

furkatgofurov7 left a comment

Choose a reason for hiding this comment

Danil-Grigorev commented Feb 14, 2024 •

edited

Loading