Skip to content

bszeti/openshift-grafana-dashboards

Repository files navigation

Custom Grafana dashboards for OpenShift v3.11

Summary by nodetype

Shows summary metrics for all three nodetypes (master, infra, compute). Different type of nodes run different type of workloads, so it’s useful to monitor resource utilization grouped this way (e.g. How much memory do I have to run application pods?).

Top panels:

  • Nodes
  • Node health
  • Node issues

Repeated panels:

  • CPU node %
  • CPU usage/allocatable %
  • CPU usage/request %
  • CPU request/allocatable %
  • CPU limits/allocatable %
  • CPU graph: Total, Allocatable, Node usage, Usage, Request, Limits
  • Memory node %
  • Memory usage/allocatable %
  • Memory usage/request %
  • Memory request/allocatable %
  • Memory limits/allocatable %
  • Memory graph: Total, Allocatable, Node usage, Usage, Request, Limits
  • Network Total: receive+transmit
  • Network Receive
  • Network Transmit
  • Disk bytes read
  • Disk bytes write
  • Disk IO Utilization
  • Filesystem %
  • Pods per node
  • Pod phase

This panel only works if the standard OpenShift labels are used on the nodes (e.g. node-role.kubernetes.io/compute=true). By default it shows master, infra and compute node groups, but if additional node roles exist, you simply have to add them to the list in $nodetype dasboard variable.

Summary by nodetype Summary by nodetype

Summary by node

The dashboard have two sections:

  • Summary resource utilization including all selected nodes
  • Resource utilization for each selected node on-by-one (repeated row)

The panels are more or less the same as on Summary by nodetype.

Summary by node

Namespaces in cluster

One or multiple projects within the cluster can be selected. Values are calculated for the selected projects. This dashboard is a customized version of the default Kubernetes / Compute Resources / Cluster dashboard in OpenShift v3.11.

Panels:

  • Pod phase
  • Pod restarts
  • CPU Usage
  • CPU Quota: Clinking on namespace name opens the Pods in Namespace dashboard
  • Memory Usage
  • Memory Quota: Clinking on namespace name opens the Pods in Namespace dashboard

Namespaces in cluster

Pods in namespace

One namespace and one or multiple pods within the namespace can be selected. Values are calculated for the selected pods. This dashboard is a customized version of the default Kubernetes / Compute Resources / Namespace (Pods) dashboard in OpenShift v3.11.

Panels:

  • Pod Phase
  • Pod Restarts
  • CPU Usage
  • CPU Quota (Pods)
  • CPU Quota (Container)
  • Memory Usage
  • Memory Quota (Pods)
  • Memory Quota (containers)

Limits and requests are set per container, the pod level metrics are simply the summary of container metrics belonging to the pod. When one container in a pod has limits set, but the other does not, the pod level usage can be more than the limits which looks a bit strange first. Taking a look at the the container level table clarifies what we see on the pod level.

Pods in namespace

Directory structure

  • dashboards: Grafana dashboard that can be manually imported or provisioned from file. They require a Prometheus datasource which should be selected as the $datasource dasboard variable. Tested with Grafana v5.4.3 and v6.3.3.
  • dashboards-with-cluster-select: The same dashboards with an extra $cluster variable. It's useful if Prometheus is collecting matrrics from more than one cluster. See Prometheus Federation.
  • dashboards-cluster-overview: Cluster overview page showing overall health of multiple clusters.
  • scripts: simply run deploy.sh to install a Grafana in the existing openshift-monitoring namespace with the custom dasboards provisioned from a ConfigMap.
  • images: Screenshots for README

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages