Engineering
Reliability Leadership
Curated Articles
How we curate
- Example Error Budget Policy
Google provides an example error budget detailing release cadence and outage and escalation policies.
- How to run a blameless postmortem
The Atlassian team discusses whether blameless postmortems are even possible, the value of effective blameless postmortems, best practices for a blameless culture, and an internal success story.
- How To Establish a High Severity Incident Management Program
In this guide, Tammy shares how to establish and measure the success of a high severity incident management program. She goes over common types and examples of SEVs, SEV levels, SLOs/SLAs, the full lifecycle of a SEV, team structure, and more.