SRE 101: Key Terms & Concepts
If you do work related to site reliability engineering, incident response or even just plain-old DevOps, it’s easy to feel like you are drowning in a sea of IT and dev-related terms and abbreviations. The IT world, in general, is riddled with acronyms and abbreviations (Wikipedia lists hundreds of them) that can be hard for the uninitiated to decipher; but the world of SRE (that’s short for Site Reliability Engineering, FYI) and incident response lean especially heavily on them.
If you’ve ever found yourself struggling to figure out what a SRE or DevOps term stands for, or if you just want to bone up on SRE-related skills and concepts — this guide is for you. In the pages that follow, we walk through more than a dozen common terms and acronyms, then explain what they mean and where they fit as concepts within the SRE landscape.
In this eBook, you’ll learn:
- The meanings of a dozen+ SRE terms, including SLO, SLA, SLI, MTTR, MTBF, MTTA, MTTD and more
- How these terms fit into concepts within the SRE landscape
- How modern SREs use these terms in their day-to-day