How they SRE — screenshot of github.com

How they SRE

This is a curated collection of publicly available resources detailing how various tech organizations implement Site Reliability Engineering. I find it valuable for understanding diverse SRE practices, tools, and culture.

Visit github.com →

Questions & Answers

What is "How they SRE"?
"How they SRE" is a curated knowledge repository that compiles publicly available resources on Site Reliability Engineering (SRE) best practices, tools, techniques, and culture. It gathers content from engineering blogs, conferences, and meetups of leading technology organizations worldwide.
Who can benefit from the "How they SRE" repository?
This repository is beneficial for SRE professionals, DevOps engineers, platform engineers, and anyone interested in learning how established tech organizations implement and practice SRE principles. It serves as a learning resource for understanding real-world SRE adoption.
How does "How they SRE" differ from other SRE learning resources?
Unlike general SRE guides or individual company blogs, "How they SRE" centrally curates and organizes a wide array of specific SRE practices and case studies from numerous organizations. This provides a consolidated view of diverse SRE implementations rather than theoretical concepts or isolated examples.
When should I refer to the "How they SRE" collection?
You should refer to this collection when seeking practical examples and insights into how different companies handle specific SRE topics like monitoring, incident response, chaos engineering, or SRE team building. It's useful for research, learning, or informing your own SRE strategy development.
What specific SRE topics are covered within this repository?
The repository covers a wide range of SRE topics including monitoring & observability, alerting, incident response & post-mortem, on-call strategies, testing in production, chaos engineering, automation, and platform engineering. It also includes sections on SRE hiring, culture, and DevOps.