-
Notifications
You must be signed in to change notification settings - Fork 5
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Document upgrade steps from RHOAI 2.10 to 2.11
- Loading branch information
1 parent
59888cb
commit 2b757db
Showing
3 changed files
with
134 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,31 @@ | ||
# Upgrading from RHOAI 2.10 | ||
|
||
These instructions assume you installed and configured RHOAI 2.10 following | ||
the MLBatch [install instructions for RHOAI-v2.10](../setup.RHOAI-v2.10/CLUSTER-SETUP.md). | ||
|
||
Your subscription will have automatically created an unapproved | ||
install plan to upgrade to RHOAI 2.11. | ||
|
||
Before beginning, verify that the expected install plan exists: | ||
```sh | ||
oc get ip -n redhat-ods-operator | ||
``` | ||
Typical output would be: | ||
```sh | ||
NAME CSV APPROVAL APPROVED | ||
install-nqrbp rhods-operator.2.10.0 Manual true | ||
install-st8vh rhods-operator.2.11.0 Manual false | ||
``` | ||
|
||
Assuming the install plan exists you can begin the upgrade process. | ||
|
||
First, update the MLBatch modifications to the default RHOAI configuration maps. | ||
```sh | ||
oc apply -f setup.RHOAI-v2.11/mlbatch-upgrade-configmaps.yaml | ||
``` | ||
|
||
Second, approve the install plan replacing the example plan name below with the actual | ||
value on your cluster: | ||
```sh | ||
oc patch ip -n redhat-ods-operator --type merge --patch '{"spec":{"approved":true}}' install-st8vh | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,102 @@ | ||
--- | ||
apiVersion: v1 | ||
kind: ConfigMap | ||
metadata: | ||
name: codeflare-operator-config | ||
namespace: redhat-ods-applications | ||
data: | ||
config.yaml: | | ||
appwrapper: | ||
Config: | ||
manageJobsWithoutQueueName: true | ||
userRBACAdmissionCheck: false | ||
schedulerName: scheduler-plugins-scheduler | ||
defaultQueueName: default-queue | ||
enabled: true | ||
--- | ||
apiVersion: v1 | ||
kind: ConfigMap | ||
metadata: | ||
name: mlbatch-kueue | ||
namespace: redhat-ods-operator | ||
data: | ||
controller_manager_config.yaml: | | ||
apiVersion: config.kueue.x-k8s.io/v1beta1 | ||
kind: Configuration | ||
health: | ||
healthProbeBindAddress: :8081 | ||
metrics: | ||
bindAddress: :8080 | ||
# enableClusterQueueResources: true | ||
webhook: | ||
port: 9443 | ||
leaderElection: | ||
leaderElect: true | ||
resourceName: c1f6bfd2.kueue.x-k8s.io | ||
controller: | ||
groupKindConcurrency: | ||
Job.batch: 5 | ||
Pod: 5 | ||
Workload.kueue.x-k8s.io: 5 | ||
LocalQueue.kueue.x-k8s.io: 1 | ||
ClusterQueue.kueue.x-k8s.io: 1 | ||
ResourceFlavor.kueue.x-k8s.io: 1 | ||
clientConnection: | ||
qps: 50 | ||
burst: 100 | ||
#pprofBindAddress: :8082 | ||
waitForPodsReady: | ||
enable: false | ||
blockAdmission: false | ||
manageJobsWithoutQueueName: true | ||
#internalCertManagement: | ||
# enable: false | ||
# webhookServiceName: "" | ||
# webhookSecretName: "" | ||
integrations: | ||
frameworks: | ||
# - "batch/job" | ||
- "kubeflow.org/mpijob" | ||
- "ray.io/rayjob" | ||
- "ray.io/raycluster" | ||
- "jobset.x-k8s.io/jobset" | ||
- "kubeflow.org/mxjob" | ||
- "kubeflow.org/paddlejob" | ||
- "kubeflow.org/pytorchjob" | ||
- "kubeflow.org/tfjob" | ||
- "kubeflow.org/xgboostjob" | ||
# - "pod" | ||
externalFrameworks: | ||
- "AppWrapper.v1beta2.workload.codeflare.dev" | ||
# podOptions: | ||
# namespaceSelector: | ||
# matchExpressions: | ||
# - key: kubernetes.io/metadata.name | ||
# operator: NotIn | ||
# values: [ kube-system, kueue-system ] | ||
manager_config_patch.yaml: | | ||
apiVersion: apps/v1 | ||
kind: Deployment | ||
metadata: | ||
name: controller-manager | ||
namespace: system | ||
spec: | ||
template: | ||
spec: | ||
priorityClassName: system-node-critical | ||
containers: | ||
- name: manager | ||
image: $(image) | ||
args: | ||
- "--config=/controller_manager_config.yaml" | ||
- "--zap-log-level=2" | ||
- "--feature-gates=LendingLimit=true" | ||
volumeMounts: | ||
- name: manager-config | ||
mountPath: /controller_manager_config.yaml | ||
subPath: controller_manager_config.yaml | ||
volumes: | ||
- name: manager-config | ||
configMap: | ||
name: manager-config | ||
--- |