James's Ramblings

Site Reliability Engineering (SRE)

Created: December 28, 2024

Site Reliability Engineering (SRE) is a discipline in the field of Software Engineering that monitors and improves the availability and performance of deployed software systems.

The Four Golden Signals

  1. Latency: The time it takes to service a request.

  2. Traffic: A measure of how much demand is being placed on your system.

  3. Errors: The rate of requests that fail.

  4. Saturation: How “full” your service is by usage of the limiting resources.

Acronym: Little Tarantulas Escape Sideways.