Fix Scaling Flaky Tests #1113

caliskanugur · 2024-01-18T16:57:16Z

Static custom scaling tests have multiple fail points and cause inconsistent failures that make it hard to debug or diagnose the issues:

Wait/watch timeouts on the cluster active state and node roles ready state ¹
Clusters get into an error state and it's not the case for manual, either while deleting or adding the nodes
We have ETCD and CP separated pools example on docs, but static runs add ETCD and CP as a single node, which might be also related to this

Additionally, we need to consider redesigning:

Static tests require configuration input, and as these are static tests, they shouldn't need user inputs
While running the whole suite, the dynamic input test will also run, we need a logic to skip these if the input isn't given like the current provisioning tests.

We need to consider redesigning, enhancing, and possibly overwriting these tests. Especially the static tests where we run them on release tests - pushed the sign-off date at least a full day.

We need to consider redesigning nested wait/watch part both for RKE1 and RKE2/K3s. This should be redundant for most of the test designs. Is it an exceptional usage where we need to block the thread twice? ↩

markusewalker · 2024-03-28T15:26:14Z

Rancher PRs
2.9: rancher/rancher#44946

Shepherd PRs
2.9: rancher/shepherd#134
2.8: rancher/shepherd#141
2.7: rancher/shepherd#142

markusewalker · 2024-04-08T17:25:43Z

Merged.

caliskanugur changed the title ~~Fix Custom Scaling Flaky Tests~~ Fix Scaling Flaky Tests Jan 18, 2024

igomez06 assigned markusewalker Jan 19, 2024

igomez06 added the team/area2 label Jan 19, 2024

rancher deleted a comment from siddhantdange Jan 24, 2024

caliskanugur mentioned this issue Jan 25, 2024

Fix Flaky Tests - Go and Python #1121

Open

6 tasks

markusewalker added this to the Current Release milestone Mar 28, 2024

markusewalker mentioned this issue Mar 28, 2024

[v2.9] Fix flaky nodescaling test cases + specify cluster types rancher/rancher#44946

Merged

This was referenced Apr 8, 2024

[v2.8] Fix flaky nodescaling test cases + specify cluster types rancher/rancher#45039

Merged

[v2.7] Fix flaky nodescaling test cases + specify cluster types rancher/rancher#45040

Merged

markusewalker closed this as completed Apr 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Scaling Flaky Tests #1113

Fix Scaling Flaky Tests #1113

caliskanugur commented Jan 18, 2024 •

edited

Loading

markusewalker commented Mar 28, 2024 •

edited

Loading

markusewalker commented Apr 8, 2024

Fix Scaling Flaky Tests #1113

Fix Scaling Flaky Tests #1113

Comments

caliskanugur commented Jan 18, 2024 • edited Loading

Footnotes

markusewalker commented Mar 28, 2024 • edited Loading

markusewalker commented Apr 8, 2024

caliskanugur commented Jan 18, 2024 •

edited

Loading

markusewalker commented Mar 28, 2024 •

edited

Loading