Tools for Chaos Engineers
- How To Establish a High Severity Incident Management Program https://www.gremlin.com/how-to-establish-a-high-severity-incident-management-program/
- Banjaxed - Open source incident management tool https://github.com/intercom-archive/banjaxed
- Cyphon - Open source incident management and response platform https://github.com/dunbarcyber/cyphon
- Arcdata - Open source incident management and volunteer scheduling application for Red Cross Disaster Services https://github.com/redcross/arcdata
- Prometheus - The Prometheus monitoring system and time series database. https://github.com/prometheus/prometheus
- PromViz - Promviz is an application that helps you visualize the traffic of your cluster from Prometheus data. https://github.com/nghialv/promviz
- Availability Calculator - Calculate how much downtime should be permitted in your SLA https://github.com/dastergon/availability-calculator
Gremlin enables you to run proactive Chaos Experiments to verify that your system can withstand failure. https://gremlin.com