Gremlin
Gremlin is a chaos engineering platform. I use it to proactively inject failures and test system resilience, which helps in identifying weaknesses before they cause incidents rather than just reacting to them.
ChaosKube periodically kills random pods within your Kubernetes cluster. This is essential for testing system resilience and ensuring applications can gracefully handle arbitrary pod failures.
Visit github.com →