Would you believe us if we said the more SEVs we have, the more reliable we are? In this talk we’ll talk about the reasons why we love SEVs at Meta, and how our culture around SEVs has allowed us to build reliable services at scale. We’ll start by exploring research from other industries about how incident culture shapes how reliable they are. We’ll then share how we’ve applied these lessons to our own culture. Along the way we’ll give a peek at our SEV tool, some insight into our SEV review process, and describe how we encourage a “culture of SEVs” from the very first day an engineer arrives at Meta.
- WATCH NOW
- 2024 EVENTS
- PAST EVENTS
- 2023
- 2022
- February
- RTC @Scale 2022
- March
- Systems @Scale Spring 2022
- April
- Product @Scale Spring 2022
- May
- Data @Scale Spring 2022
- June
- Systems @Scale Summer 2022
- Networking @Scale Summer 2022
- August
- Reliability @Scale Summer 2022
- September
- AI @Scale 2022
- November
- Networking @Scale Fall 2022
- Video @Scale Fall 2022
- December
- Systems @Scale Winter 2022
- 2021
- 2020
- 2019
- 2018
- 2017
- 2016
- 2015
- Blog & Video Archive
- Speaker Submissions