Skip to content

Latest commit

 

History

History
83 lines (66 loc) · 3.95 KB

07-cluster-scaling.md

File metadata and controls

83 lines (66 loc) · 3.95 KB

Working with Azure Kubernetes Service Cluster Scaling

Imagine a scenario where your realize that your existing cluster is at capacity and you need to scale it out to add more nodes in order to increase capacity and be able to deploy more PODS.

Scale Application

  1. Check to see current number of pods running via Grafana Dashboard.
  • Go to the same Grafana Dashboard from lab 6 and look at the Pods Running Count section. You will see the total count of Pods and the various phases they are in.

  1. Check to see current number of heroes pods running via K8s CLI.
kubectl get pods | grep heroes
# You should see something like the following as output (one replica of each pod):
heroes-api-deploy-1165643395-fwjtm             1/1       Running   0          2d
heroes-db-deploy-839157328-4656j               1/1       Running   0          2d
heroes-web-1677855039-8t57k                    1/1       Running   0          2d
  1. Scale out the Web application
  • To simulate a real-world scenario we are going to scale the web app to handle increased load.
# This command will create multiple replicas of the heroes-web pod to simulate additional load on the cluster.
kubectl scale deploy/heroes-web-deploy --replicas=4
  1. Check to see number of pods now running via Grafana Dashboard

  1. Check to see number of heroes pods running via kubectl
kubectl get pod | grep heroes
# You should see something like the following as output (more than one heroes-web pod and some of them in different states):
NAME                                                              READY     STATUS    RESTARTS   AGE
heroes-web-3683626428-4m1v4                                       0/1       Pending   0          2m
heroes-web-3683626428-hcs49                                       1/1       Running   0          4m
heroes-web-3683626428-z1t1j                                       0/1       Pending   0          2m
heroes-web-3683626428-zxp2s                                       1/1       Running   0          2m
  1. Check up on Pods Running in Grafana dashboard
  • As you can see we have a number of pods that are in the pending state which means they are trying to be scheduled to run. In this scenario the cluster is out of capacity so they are not able to be scheduled.

Scale K8s Cluster

  1. Check to see number of current nodes running.
kubectl get nodes
# You should see something like the following as output (there is one node in the cluster):
NAME                       STATUS    ROLES     AGE       VERSION
aks-nodepool1-42552728-0   Ready     agent     4h        v1.11.2
aks-nodepool1-42552728-1   Ready     agent     4h        v1.11.2
  1. Scale out AKS cluster to accomodate the demand
# set these values to match yours (the cluster and the RG are the same name)
RESOURCE_GROUP_NAME=$(az group list | jq '.[0]."name"' -r)
AKS_CLUSTER_NAME="${RESOURCE_GROUP_NAME//_}"

az aks scale -g $RESOURCE_GROUP_NAME -n $AKS_CLUSTER_NAME --node-count 4

Note this may take some time. Good time to get some coffee.

  1. Check to see if the new nodes are deployed and "Ready"
kubectl get nodes
# You should see something like the following as output (there are now 4 nodes in the cluster):
NAME                       STATUS    ROLES     AGE       VERSION
aks-nodepool1-42552728-0   Ready     agent     5h        v1.11.2
aks-nodepool1-42552728-1   Ready     agent     5h        v1.11.2
aks-nodepool1-42552728-2   Ready     agent     7m        v1.11.2
aks-nodepool1-42552728-3   Ready     agent     7m        v1.11.2
  1. Re-visit Grafana Dasboard to validate cluster scale is working.
  • Take a look at the Pods Pending Count again and you should see that after a few minutes the number of pending pods is going down.

You now have additional node capacity in your Azure Kubernetes Service cluster to be able to provision more pods.