Site Reliability Engineering
How Google Runs Production Systems
Chris Jones author Niall Richard Murphy author Betsy Beyer author Jennifer Petoff author Christof Leng author David Huska author
Format:Paperback
Publisher:O'Reilly Media
Publishing:31st Oct '26
£63.99
This title is due to be published on 31st October, and will be despatched as soon as possible.

Google pioneered the discipline of Site Reliability Engineering, applying reliability to the entire user journey for consumer, enterprise, and infrastructure systems. In the years since, many organizations have followed suit, guided by the tenets laid out in this practical book. This fully revised edition brings Site Reliability Engineering up-to-date with fresh insights on engineering techniques, organizational processes, and case studies that will help you promote and implement greater reliability throughout the engineering lifecycle.
In this collection of essays and articles, key members of Google's Site Reliability Engineering team explore the company's current SRE practices and explain how they've evolved in the decade since the initial publication. New updates cover the value of reliability, cloud reliability, and the impact of AI. You'll learn the principles and practices that enable Google engineers to make some of the world's largest systems scalable, reliable, and efficient—lessons directly applicable to your organization.
- Train new Site Reliability Engineers based on the latest practices in the field
- Develop engineering organizations that support reliability as a feature
- Build online services that incorporate reliability principles
- Use AI to improve SRE across the organization and optimize critical areas such as automation and incident detection <
ISBN: 9798341607682
Dimensions: unknown
Weight: unknown
600 pages
2nd Revised edition