Proposal for managing Managed ETCD in k3s provider #97

mogliang · 2024-04-02T07:23:57Z

Initially, we discussed possible solution to managing etcd, and the conclusion is to create etcd proxy pod and then reuse kubeadm code to manage etcd.
#75

Recently, we discussed with k3s guys.
k3s-io/k3s#9818
k3s-io/k3s#9841
They mentioned there is k3s embedded controller living inside k3s process and manage etcd lifecycle, and it also exposes interfaces to allow us to interact, I think this may be a better direction we can choose. Here i drafted the design doc.

Please help reivew and comment~

richardcase · 2024-04-08T09:53:02Z

There is an open PR for the RKE2 provider to manage the ETCD membership, which is relevant to this: rancher/cluster-api-provider-rke2#265

mogliang · 2024-04-09T01:49:41Z

There is an open PR for the RKE2 provider to manage the ETCD membership, which is relevant to this: rancher-sandbox/cluster-api-provider-rke2#265

Thanks Richard, although RKE2 inherits from K3s, they hosts etcd in different ways. RKE2 is more like k8s, hosting etcd by static pod, the PR you mentioned is using the same way as how kubeadm cp provider manages etcd.

Well, k3s combine embeded etcd as part of k3s host process, besides, k3s itself has controllers to manage embedded etcd. guys from k3s also suggest not operate on etcd directly. So, i'm proposing leverage k3s etcd controller to manage etcd.

I've created a branch locally to implement the etcd management by following this doc, and it's working fine. @richardcase should we combine the implementation code in this PR as well? Or put it in a separate PR?

Signed-off-by: qliang <[email protected]>

richardcase · 2024-04-10T08:30:27Z

I've created a branch locally to implement the etcd management by following this doc, and it's working fine. @richardcase should we combine the implementation code in this PR as well? Or put it in a separate PR?

Thanks @mogliang . I'd keep this PR for the doc and have a separate PR for the implementation. I will make sure i review the proposal today.

And great that you have it working 🎉

nasusoba · 2024-04-26T08:51:20Z

I did some more investigation to find if etcd proxy could be replaced by k3s etcd controller.

Case1: Monitor ETCD state for Scale & Remediation preflight check.

For kubeadm CAPI, it has 2 health checks for monitoring etcd health per etcd node:

Check 1: If the list of members IDs reported by this etcd member is the same as all other members.
- Does k3s CAPI need this check
  - Yes, we also need to ensure quorum remains before removing a node
- k3s etcd controller support for this check
  - this condition check is not supported by k3s etcd controller, we needs to modify k3s code
Check 2: If an etcd member has any alarm
- Does k3s CAPI need this check
  - Yes, we also need to check the alarm to see if etcd is healthy
- k3s etcd controller support for this check
  - this condition is reported by EtcdIsVoter (code for reporting it),

Case 2: Remove ETCD member before removing an controlplane node

We need this annotation, otherwise scaling down from 2 nodes to 1 nodes is failing(#96).

Case 3: Reconcile ETCD members on controlplane CR reconcile loop.

For kubeadm CAPI, it will iterate over all etcd members and find members that do not have corresponding nodes. If such member is found, it will get removed from etcd member list.

We also need to reconcile etcd members to prevent losing quorum when deleting a node. But this is not supported by k3s etcd controller and we need to modfiy k3s code.

Conclusion

If we need to remove etcd proxy, and rely on k3s etcd controller, we need to implement Case 1 check 1 and Case 3 in k3s code. We need more discussion if the change is needed. For now, we could simply implement Case 2 to fix #96.

mogliang · 2024-04-28T01:52:19Z

Thanks @nasusoba for the detailed investigation. So, let's keep the etcd proxy way to implement etcd feature.

We may also need work with k3s to fix the gap, it's a better way to leverage k3s etcd controller for the long run.

mogliang mentioned this pull request Apr 2, 2024

💡How to access ETCD from mgmt cluster? #75

Open

qliang added 3 commits April 10, 2024 05:50

add etcd design doc

3a32e3c

Signed-off-by: qliang <[email protected]>

fix format

681ae01

Signed-off-by: qliang <[email protected]>

use go version from go.mod

e63ab59

Signed-off-by: qliang <[email protected]>

mogliang force-pushed the etcddoc branch from 6d8c0c0 to e63ab59 Compare April 10, 2024 05:51

mogliang closed this Apr 28, 2024

nasusoba mentioned this pull request May 6, 2024

Modify node removal logic with removal annotation [Etcd proxy] #103

Merged

mogliang deleted the etcddoc branch September 5, 2024 05:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal for managing Managed ETCD in k3s provider #97

Proposal for managing Managed ETCD in k3s provider #97

mogliang commented Apr 2, 2024 •

edited

Loading

richardcase commented Apr 8, 2024

mogliang commented Apr 9, 2024 •

edited

Loading

richardcase commented Apr 10, 2024

nasusoba commented Apr 26, 2024

mogliang commented Apr 28, 2024

Proposal for managing Managed ETCD in k3s provider #97

Proposal for managing Managed ETCD in k3s provider #97

Conversation

mogliang commented Apr 2, 2024 • edited Loading

richardcase commented Apr 8, 2024

mogliang commented Apr 9, 2024 • edited Loading

richardcase commented Apr 10, 2024

nasusoba commented Apr 26, 2024

Case1: Monitor ETCD state for Scale & Remediation preflight check.

Case 2: Remove ETCD member before removing an controlplane node

Case 3: Reconcile ETCD members on controlplane CR reconcile loop.

Conclusion

mogliang commented Apr 28, 2024

mogliang commented Apr 2, 2024 •

edited

Loading

mogliang commented Apr 9, 2024 •

edited

Loading