Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RHOAIENG-17306, RHOAIENG-17307, RHOAIENG-17308: feat(workbenches): tolerate IPv6 environments in codeserver, jupyterlab and rstudio #827

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

chaturvedi-kna
Copy link

@chaturvedi-kna chaturvedi-kna commented Dec 20, 2024

Tied to a PR against Dashboard

Description

This PR aims to bring support for single stack IPv6 env's.
This change allows Jupyter server to listen on all IP addresses
Note: This is a replacement of PR

Changes include:

  • Updated --ServerApp.ip=0.0.0.0 to --ServerApp.ip="" across all applicable start-notebook.sh files.
  • Verified and confirmed the changes were applied consistently across all Jupyter notebooks.

Modified Files:

  1. jupyter/intel/ml/ubi9-python-3.11/start-notebook.sh
  2. jupyter/intel/ml/ubi9-python-3.9/start-notebook.sh
  3. jupyter/intel/pytorch/ubi9-python-3.11/start-notebook.sh
  4. jupyter/intel/pytorch/ubi9-python-3.9/start-notebook.sh
  5. jupyter/intel/tensorflow/ubi9-python-3.11/start-notebook.sh
  6. jupyter/intel/tensorflow/ubi9-python-3.9/start-notebook.sh
  7. jupyter/minimal/ubi9-python-3.11/start-notebook.sh
  8. jupyter/minimal/ubi9-python-3.9/start-notebook.sh

How Has This Been Tested?

Testing was performed by building and running a custom Jupyter Data Science UBI9 Python 3.9 image derived from the Minimal UBI9 Python 3.9 image.

Environment Details:

  • Platform: OpenShift AI
  • Cluster Type: Single stack IPv6 OpenShift 4.16 cluster

Testing Process:

  1. Built the updated images based on the modified start-notebook.sh scripts.
  2. Deployed and ran the images in the OpenShift AI environment.
  3. Verified successful start-up and functionality of the Jupyter Notebooks in the cluster.

Merge criteria:

  • The commits are squashed in a cohesive manner and have meaningful messages.
  • Testing instructions have been added in the PR body (for PRs involving changes that are not immediately obvious).
  • The developer has manually tested the changes and verified that the changes work.

Copy link
Contributor

openshift-ci bot commented Dec 20, 2024

Hi @chaturvedi-kna. Thanks for your PR.

I'm waiting for a opendatahub-io member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@chaturvedi-kna
Copy link
Author

@jiridanek and @atheo89

While making changes to the code server, I noticed a specific instruction to remove listening on IPv6. Was that intentional.

I have built and deployed code server image manually in IPv6 only env and tested it post modifications works fine.

Let me know if any further adjustments are needed!

@chaturvedi-kna
Copy link
Author

/retest-required

1 similar comment
@chaturvedi-kna
Copy link
Author

/retest-required

@jiridanek
Copy link
Member

@chaturvedi-kna the remaining failures are stubborn infra issues, there's little point in retrying the tests, they won't pass. The OpenShift-CI tests are largely duplicated by GitHub Actions tests, so when GitHub actions all pass, then I consider a PR to be above suspicion.

@jiridanek jiridanek changed the title Fix: Update to support IPv6 environments Fix: Update codeserver and jupyter workbenches to support IPv6 environments Dec 23, 2024
@jiridanek jiridanek changed the title Fix: Update codeserver and jupyter workbenches to support IPv6 environments RHOAIENG-17306, RHOAIENG-17307: feat(workbenches): support IPv6 environments in codeserver and jupyterlab Dec 23, 2024
@jiridanek
Copy link
Member

The GHA failures are known issues, see https://issues.redhat.com/browse/RHOAIENG-16587

@chaturvedi-kna
Copy link
Author

Hi @jiridanek, Thanks for clarifying. I felt something wrong with the infra but thought it may be due to being busy. Anyhow, I guess I missed the required changes for rstudio. I will do the modifications, local testing and update PR tomorrow.

@jiridanek
Copy link
Member

Thanks. All maintainer of this repo (incl. me) are on Christmas holiday this week, so somebody will look at it the next week.

@jiridanek
Copy link
Member

My take is that these changes are harmless for ipv4/v6 dualstack; gotta figure out how to check still work with ipv4 singlestack. Checking that this actually fixed v6 is not all that important for me, my first responsibility should be not to cause a regression in existing (guaranteed) functionality. Ipv6 is for me at this point a nice optional bonus.

@chaturvedi-kna chaturvedi-kna changed the title RHOAIENG-17306, RHOAIENG-17307: feat(workbenches): support IPv6 environments in codeserver and jupyterlab RHOAIENG-17306, RHOAIENG-17307, RHOAIENG-17308: feat(workbenches): support IPv6 environments in codeserver, jupyterlab and rstudio Jan 7, 2025
@chaturvedi-kna
Copy link
Author

I understand the importance of ensuring that IPv4 functionality remains intact and does not regress. I have kept that in mind while making changes. However, I currently do not have access to an IPv4 single-stack environment to perform the necessary tests.

Could you please confirm if the existing tests cover this case? If not, I would appreciate any guidance on how to proceed with testing in an IPv4 single-stack environment.

@jiridanek
Copy link
Member

jiridanek commented Jan 7, 2025

I wanted to look into docker/podman networking for this. Worst case, create the container without any network and then add ipv4 interface by following some of those "hard way" guides, like all these ip netns commands in https://lists.podman.io/archives/list/[email protected]/thread/W6MCYO6RY5YFRTSUDAOEZA7SC2EFXRZE/. Maybe I'd ask chatgpt how to do it.

edit: the last two comments on containers/podman#22359 say that all of ipv4 only, ipv6 only and ipv4v6 dualstack is currently possible with podman, just that the cli options to enable it are somewhat confusing; so this is good news

edit: for testing the image, I'm completely satisfied in trying with podman, it's sufficiently faithful representation of what happens in openshift. Going into https://docs.redhat.com/en/documentation/openshift_container_platform/4.17/html/networking/multiple-networks should not be necessary.

@jiridanek
Copy link
Member

jiridanek commented Jan 13, 2025

Here's what I tried

$ podman run --platform=linux/amd64 --sysctl net.ipv6.conf.all.disable_ipv6=1 -p 8888:8888 --rm -it quay.io/opendatahub/workbench-images:codeserver-ubi9-python-3.11-pr-827
$ curl localhost:8888

And that works.

So the way it is, listening on :: is fine even without ipv6 being enabled; what would be a problem is listening on ::1.

$ python3
>>> import socket

>>> sock = socket.socket()
>>> sock.bind(('::', 0))
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
socket.gaierror: [Errno -9] Address family for hostname not supported
# fail

>>> sock = socket.socket(socket.AF_INET6)
>>> sock.bind(('::1', 0))
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
OSError: [Errno 99] Cannot assign requested address
# fail

>>> sock = socket.socket(socket.AF_INET6)
>>> sock.bind(('::', 0))
# ok

I started writing the ipv6 test in

The IPV6_V6ONLY flag is of some interest https://man7.org/linux/man-pages/man7/ipv6.7.html

Comment on lines +65 to +69
return '::' # Dual-stack
elif supports_ipv6:
return '::'
elif supports_ipv4:
return '0.0.0.0'
Copy link
Member

@jiridanek jiridanek Jan 13, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm inclined to replace all this with return '::', because that should work just everywhere; We have ubi9 images that do support ipv6; and jupyterlab/etc. all use socket.AF_INET6 in effect. But your version feels somewhat more cautious, making fewer assumptions, and therefore safer.

@@ -60,7 +84,7 @@ def _get_cmd(port):
'--server-working-dir=' + os.getenv('HOME'),
'--auth-none=1',
'--www-frame-origin=same',
#'--www-address=0.0.0.0',
'--www-address='+ detect_env(),
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@jiridanek
Copy link
Member

/lgtm

when we get the oc version tests, and the extension for checking ipv4-only, i'd like to get this in; so in time for openshift ai 2.18 hopefully

@jiridanek
Copy link
Member

I've rebased the PR on top of current main, to bring in the recent changes from Andy and myself.

@jiridanek
Copy link
Member

jiridanek commented Jan 21, 2025

./scripts/test_jupyter_with_papermill.sh: line 345: jupyter_ml_notebook_id: unbound variable

https://github.com/opendatahub-io/notebooks/actions/runs/12891528203/job/35943757399#step:37:164

Hmm, I probably broke that in #843, and I did not recheck the full "push" results after one final small change; and since pr checks only build changed images, this was missed.

edit: fixed now

@jiridanek jiridanek force-pushed the clean-fix-ipv6-issue branch from 9ce93ee to df54b86 Compare January 21, 2025 18:20
@openshift-ci openshift-ci bot added size/m and removed size/m labels Jan 21, 2025
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'll see if I can make ocp-ci a bit greener by rerunning a few times
/retest

@jiridanek
Copy link
Member

/override ci/prow/codeserver-notebook-e2e-tests ci/prow/images ci/prow/notebooks-ubi9-e2e-tests ci/prow/rstudio-notebook-e2e-tests

Copy link
Contributor

openshift-ci bot commented Jan 22, 2025

@jiridanek: Overrode contexts on behalf of jiridanek: ci/prow/codeserver-notebook-e2e-tests, ci/prow/images, ci/prow/notebooks-ubi9-e2e-tests, ci/prow/rstudio-notebook-e2e-tests

In response to this:

/override ci/prow/codeserver-notebook-e2e-tests ci/prow/images ci/prow/notebooks-ubi9-e2e-tests ci/prow/rstudio-notebook-e2e-tests

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@jiridanek
Copy link
Member

now I believe sufficient regression testing is in place to ensure we are not breaking the ipv4 and dualstack cases, so I'm seriously considering merging this in

@jiridanek
Copy link
Member

/lgtm
/approve

@openshift-ci openshift-ci bot added the lgtm label Jan 23, 2025
Copy link
Contributor

openshift-ci bot commented Jan 23, 2025

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jiridanek

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@jiridanek jiridanek force-pushed the clean-fix-ipv6-issue branch from df54b86 to 978bb67 Compare January 23, 2025 09:20
@openshift-ci openshift-ci bot removed the lgtm label Jan 23, 2025
Copy link
Contributor

openshift-ci bot commented Jan 23, 2025

New changes are detected. LGTM label has been removed.

@openshift-ci openshift-ci bot added size/m and removed size/m labels Jan 23, 2025
@jiridanek jiridanek added the lgtm label Jan 23, 2025
@jiridanek jiridanek changed the title RHOAIENG-17306, RHOAIENG-17307, RHOAIENG-17308: feat(workbenches): support IPv6 environments in codeserver, jupyterlab and rstudio RHOAIENG-17306, RHOAIENG-17307, RHOAIENG-17308: feat(workbenches): tolerate IPv6 environments in codeserver, jupyterlab and rstudio Jan 23, 2025
@openshift-ci openshift-ci bot added size/m and removed size/m labels Jan 23, 2025
@jiridanek
Copy link
Member

OCP-CI runs failed with infra issues

level=info msg=Creating infrastructure resources...
level=error
level=error msg=Error: Provider produced inconsistent result after apply
level=error
level=error msg=When applying changes to module.master.google_service_account.master-node-sa,
level=error msg=provider "openshift/local/google" produced an unexpected new value: Root
level=error msg=resource was present, but now absent.
level=error
level=error msg=This is a bug in the provider, which should be reported in the provider's own
level=error msg=issue tracker.

@jiridanek
Copy link
Member

/override ci/prow/codeserver-notebook-e2e-tests ci/prow/rstudio-notebook-e2e-tests ci/prow/notebooks-ubi9-e2e-tests

Copy link
Contributor

openshift-ci bot commented Jan 23, 2025

@jiridanek: Overrode contexts on behalf of jiridanek: ci/prow/codeserver-notebook-e2e-tests, ci/prow/notebooks-ubi9-e2e-tests, ci/prow/rstudio-notebook-e2e-tests

In response to this:

/override ci/prow/codeserver-notebook-e2e-tests ci/prow/rstudio-notebook-e2e-tests ci/prow/notebooks-ubi9-e2e-tests

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Copy link
Contributor

openshift-ci bot commented Jan 23, 2025

@chaturvedi-kna: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/intel-notebooks-e2e-tests 05fead0 link true /test intel-notebooks-e2e-tests
ci/prow/notebook-jupyter-intel-tf-ubi9-python-3-11-pr-image-mirror 05fead0 link true /test notebook-jupyter-intel-tf-ubi9-python-3-11-pr-image-mirror
ci/prow/images 978bb67 link true /test images

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants