-
Notifications
You must be signed in to change notification settings - Fork 5
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'main' of https://github.com/datacenter/aci-monitoring-s…
- Loading branch information
Showing
21 changed files
with
633 additions
and
299 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
# LABDCN-2620: Open Source Monitoring for Cisco ACI - Cisco Live APJC 2024 | ||
|
||
This section contains specific instruction on how to run the LABDCN-2620 Walk In Lab. | ||
This lab runs on a pre-existing Kubernetes cluster and can support up to 30 concurrent students. | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,114 @@ | ||
# Access | ||
|
||
The Demo environment is hosted in a DMZ and ca be accessed with the following credentials: | ||
|
||
https://64.104.255.11/ | ||
|
||
user: `guest` | ||
password: `guest` | ||
|
||
The guest user is able to modify the dashboards and run `Explore` queries however it can't save any of the configuration changes. | ||
|
||
# Exploring the ACI Monitoring Stack | ||
|
||
In this section I am gonna guide you trough the available dashboards and how to use them. | ||
|
||
*Note:* Grafana support building dashboard with data coming from Multiple data source but for the moment, the ACI Monitoring stack does not make use of such capability. | ||
|
||
All the Dashboards are located in the `ACI` Folder in the `Dashboards` section of the UI: | ||
![dashboards](images/dashboards.png) | ||
|
||
|
||
## Prometheus backed Dashboards | ||
|
||
These dashboards are using `Prometheus` as data source meaning the data we are visualizing came from an ACI Managed Object and was translated by the `aci-exporter` | ||
|
||
### ACI Faults | ||
This dashboard is a 1:1 copy of the faults that are present inside ACI. The main advantages copmpared to looking at the faults in the ACI UI are: | ||
- the ability to aggregating Faults from Multiple Fabrics in a single table | ||
- allowing advanced sorting and filtering | ||
|
||
|
||
![faults](images/faults.png) | ||
|
||
By using the `Fabric` drop down menu you can select different Fabrics (or All) and you can use the Colum headers to filter/sort the data: | ||
|
||
<img src=images/fabric-filter.png width="400"> | ||
<img src=images/column-filter.png width="400"> | ||
|
||
This is a good dashboard to understand how Grafana dashboards are built, if you are interested on building your own dashboard you can take a look [here](labs/lab1.md). | ||
|
||
|
||
|
||
### EPG Explore | ||
|
||
The EPG Explore is composed of 2 tables: | ||
- EPG To Interface - VLANs: This table allows the user to map a EPG to a VLAN port on a switch. This table can be filtered by: | ||
- fabric | ||
- tenant | ||
- epg | ||
- V(x)LANs to EPG - Interface: This table allows the user to map a VLAN to an EPG and a port on a switch. This table can be filtered by: | ||
- VLAN | ||
- VXLAN | ||
|
||
*Limitations:* This has not yet been tested with overlapping VLANs | ||
|
||
### EPG Stats | ||
|
||
This dashboard contains the following time series graphs: | ||
|
||
- EPG RX Gbits/s: This show the Received traffic in the EPG | ||
- EPG TX Gbits/s: This show the Transmitted traffic by the EPG | ||
- EPG Drops RX Pkts/s: This show the number of Packet drops in the ingress direction | ||
- EPG Drops TX Pkts/s: This show the number of Packet drops in the egress direction | ||
|
||
These dashboards are built with the same logic as the ACI EPG Stats dashboards, just in Grafana | ||
|
||
### Fabric Capacity | ||
|
||
This dashboard contains the same info as the APIC Fabric Capacity dashboard but allows to plot the resource usage over a time period to better monitor the fabric utilization over time | ||
|
||
### Node Capacity | ||
|
||
This dashboard contains the same info as the APIC FabrNodeic Capacity dashboard but allows to plot the resource usage over a time period to better monitor the fabric utilization over time | ||
|
||
### Node Details | ||
|
||
This dashboard contains the following time series graphs: | ||
|
||
- Node CPU Usage | ||
- Node Memory Usage | ||
- Node Health | ||
|
||
### Nodes Interfaces | ||
|
||
This dashboard contains the following graphs: | ||
|
||
- Node Interface status: This dashboard shows which interface are Up/Down | ||
- Interface RX/TX Usage: This dashboard shows the interface utilization in %, it is sorted by highest usage and will display the top 10 interfaces by usage. | ||
|
||
### Power Usage | ||
|
||
This dashboard display a time series graph of the average power draw per switch | ||
|
||
### Routing Protocols | ||
|
||
This dashboard contains the following graphs: | ||
|
||
- L3 Neighbours: For every BGP or OSPF neighbors we display the Node to Peer IP peering, the routing protocol used the State of the connect etc... | ||
- BGP Advertised/Received Paths: For every BGP peering we display the number of paths received/advertised | ||
- BGP Accepted Paths: Time series graph of **received** BGP prefixes | ||
|
||
### Vlans | ||
|
||
Display the APIC config for VLAN Pools and VMM Custom Trunk Ports in filterable tables. | ||
|
||
## Loki backed Dashboards | ||
|
||
These dashboards are using `Loki` as data source meaning the data we are visualizing came from an ACI Syslog Message | ||
|
||
### Contract Drops Logs | ||
|
||
This dashboard parses the logs received by the switches and extract infos on the Contract Drop Logs. This requires a specific [config](syslog.md) on ACI and is limited to 500 Messages/s per switch | ||
|
||
|
Oops, something went wrong.