Increase timeout for allow successful data re-balance on VSphere/Azure platforms #9994

am-agrawa · 2024-06-27T14:12:52Z

We saw multiple failures on VSphere IPI for test test_add_capacity_ui with data re-balance issue however the test passes on AWS IPI. Increasing the re-balance timeout should stabilise it.

am-agrawa · 2024-07-09T07:07:54Z

Passed here- https://url.corp.redhat.com/a255f95 and https://url.corp.redhat.com/6c815c1

yitzhak12 · 2024-07-09T12:27:01Z

tests/functional/z_cluster/cluster_expansion/test_add_capacity.py

@@ -112,7 +112,7 @@ def add_capacity_test(ui_flag=False):
        verify_storage_device_class(device_class)
        verify_device_class_in_osd_tree(ct_pod, device_class)

-    check_ceph_health_after_add_capacity(ceph_rebalance_timeout=3600)
+    check_ceph_health_after_add_capacity(ceph_rebalance_timeout=5400)


It's not mandatory, but if the issue is only with vSphere IPI, maybe we can do something like this:

timeout = 3600 if is_vsphere_ipi_cluster() else 5400
check_ceph_health_after_add_capacity(ceph_rebalance_timeout=timeout)

No, data re-balance issue is also seen with VSphere UPI and Azure IPI.

Signed-off-by: am-agrawa <[email protected]>

openshift-ci · 2024-07-10T09:23:15Z

New changes are detected. LGTM label has been removed.

openshift-ci · 2024-07-10T09:36:19Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: am-agrawa, ebenahar, prsurve

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

am-agrawa · 2024-07-10T09:38:44Z

/cherry-pick release-4.15

am-agrawa · 2024-07-10T09:38:48Z

/cherry-pick release-4.14

am-agrawa · 2024-07-10T09:38:52Z

/cherry-pick release-4.13

am-agrawa · 2024-07-10T09:39:05Z

/cherry-pick release-4.12

openshift-cherrypick-robot · 2024-07-10T09:39:33Z

@am-agrawa: #9994 failed to apply on top of branch "release-4.13":

Applying: increase timeout for data re-balance on vsphere ipi
Using index info to reconstruct a base tree...
A	tests/functional/z_cluster/cluster_expansion/test_add_capacity.py
Falling back to patching base and 3-way merge...
Auto-merging tests/manage/z_cluster/cluster_expansion/test_add_capacity.py
CONFLICT (content): Merge conflict in tests/manage/z_cluster/cluster_expansion/test_add_capacity.py
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
Patch failed at 0001 increase timeout for data re-balance on vsphere ipi
When you have resolved this problem, run "git am --continue".
If you prefer to skip this patch, run "git am --skip" instead.
To restore the original branch and stop patching, run "git am --abort".

In response to this:

/cherry-pick release-4.13

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-cherrypick-robot · 2024-07-10T09:39:34Z

@am-agrawa: new pull request created: #10067

In response to this:

/cherry-pick release-4.15

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-cherrypick-robot · 2024-07-10T09:39:34Z

@am-agrawa: new pull request created: #10068

In response to this:

/cherry-pick release-4.14

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

openshift-cherrypick-robot · 2024-07-10T09:39:45Z

@am-agrawa: #9994 failed to apply on top of branch "release-4.12":

Applying: increase timeout for data re-balance on vsphere ipi
Using index info to reconstruct a base tree...
A	tests/functional/z_cluster/cluster_expansion/test_add_capacity.py
Falling back to patching base and 3-way merge...
Auto-merging tests/manage/z_cluster/cluster_expansion/test_add_capacity.py
CONFLICT (content): Merge conflict in tests/manage/z_cluster/cluster_expansion/test_add_capacity.py
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
Patch failed at 0001 increase timeout for data re-balance on vsphere ipi
When you have resolved this problem, run "git am --continue".
If you prefer to skip this patch, run "git am --skip" instead.
To restore the original branch and stop patching, run "git am --abort".

In response to this:

/cherry-pick release-4.12

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

…e platforms (red-hat-storage#9994) Signed-off-by: am-agrawa <[email protected]>

am-agrawa added the Squad/Brown label Jun 27, 2024

am-agrawa self-assigned this Jun 27, 2024

am-agrawa requested a review from a team as a code owner June 27, 2024 14:12

pull-request-size bot added the size/XS label Jun 27, 2024

am-agrawa added the Verified Mark when PR was verified and log provided label Jul 9, 2024

yitzhak12 reviewed Jul 9, 2024

View reviewed changes

am-agrawa requested a review from a team July 10, 2024 08:59

yitzhak12 previously approved these changes Jul 10, 2024

View reviewed changes

openshift-ci bot assigned yitzhak12 Jul 10, 2024

openshift-ci bot added the lgtm label Jul 10, 2024

ebenahar previously approved these changes Jul 10, 2024

View reviewed changes

openshift-ci bot assigned ebenahar Jul 10, 2024

am-agrawa added 2 commits July 10, 2024 14:51

increase timeout for data re-balance on vsphere ipi

526439c

Signed-off-by: am-agrawa <[email protected]>

rebase

13a1b2e

Signed-off-by: am-agrawa <[email protected]>

am-agrawa dismissed stale reviews from ebenahar and yitzhak12 via 13a1b2e July 10, 2024 09:23

am-agrawa force-pushed the data-rebalance-fix branch from 38d5cef to 13a1b2e Compare July 10, 2024 09:23

openshift-ci bot removed the lgtm label Jul 10, 2024

am-agrawa changed the title ~~Increase timeout for allow successful data re-balance on VSphere IPI~~ Increase timeout for allow successful data re-balance on VSphere/Azure platforms Jul 10, 2024

am-agrawa added the lgtm label Jul 10, 2024

prsurve approved these changes Jul 10, 2024

View reviewed changes

openshift-ci bot assigned prsurve Jul 10, 2024

ebenahar approved these changes Jul 10, 2024

View reviewed changes

ebenahar merged commit 495e7c2 into red-hat-storage:master Jul 10, 2024
5 of 6 checks passed

am-agrawa deleted the data-rebalance-fix branch July 10, 2024 09:39

openshift-cherrypick-robot mentioned this pull request Jul 10, 2024

[release-4.15] Increase timeout for allow successful data re-balance on VSphere/Azure platforms #10067

Merged

openshift-cherrypick-robot mentioned this pull request Jul 10, 2024

[release-4.14] Increase timeout for allow successful data re-balance on VSphere/Azure platforms #10068

Merged

amr1ta pushed a commit to amr1ta/ocs-ci that referenced this pull request Jul 25, 2024

Increase timeout for allow successful data re-balance on VSphere/Azur…

fae27ae

…e platforms (red-hat-storage#9994) Signed-off-by: am-agrawa <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase timeout for allow successful data re-balance on VSphere/Azure platforms #9994

Increase timeout for allow successful data re-balance on VSphere/Azure platforms #9994

am-agrawa commented Jun 27, 2024

am-agrawa commented Jul 9, 2024 •

edited

Loading

yitzhak12 Jul 9, 2024

am-agrawa Jul 10, 2024

yitzhak12 Jul 10, 2024

openshift-ci bot commented Jul 10, 2024

openshift-ci bot commented Jul 10, 2024

am-agrawa commented Jul 10, 2024

am-agrawa commented Jul 10, 2024

am-agrawa commented Jul 10, 2024

am-agrawa commented Jul 10, 2024

openshift-cherrypick-robot commented Jul 10, 2024

openshift-cherrypick-robot commented Jul 10, 2024

openshift-cherrypick-robot commented Jul 10, 2024

openshift-cherrypick-robot commented Jul 10, 2024

Increase timeout for allow successful data re-balance on VSphere/Azure platforms #9994

Increase timeout for allow successful data re-balance on VSphere/Azure platforms #9994

Conversation

am-agrawa commented Jun 27, 2024

am-agrawa commented Jul 9, 2024 • edited Loading

yitzhak12 Jul 9, 2024

Choose a reason for hiding this comment

am-agrawa Jul 10, 2024

Choose a reason for hiding this comment

yitzhak12 Jul 10, 2024

Choose a reason for hiding this comment

openshift-ci bot commented Jul 10, 2024

openshift-ci bot commented Jul 10, 2024

am-agrawa commented Jul 10, 2024

am-agrawa commented Jul 10, 2024

am-agrawa commented Jul 10, 2024

am-agrawa commented Jul 10, 2024

openshift-cherrypick-robot commented Jul 10, 2024

openshift-cherrypick-robot commented Jul 10, 2024

openshift-cherrypick-robot commented Jul 10, 2024

openshift-cherrypick-robot commented Jul 10, 2024

am-agrawa commented Jul 9, 2024 •

edited

Loading