Site Reliability Engineering (SRE)
Created: December 28, 2024
Site Reliability Engineering (SRE) is a discipline in the field of Software Engineering that monitors and improves the availability and performance of deployed software systems.
The Four Golden Signals
-
Latency: The time it takes to service a request.
-
Traffic: A measure of how much demand is being placed on your system.
-
Errors: The rate of requests that fail.
-
Saturation: How “full” your service is by usage of the limiting resources.
Acronym: Little Tarantulas Escape Sideways.