What availability and reliability mean in distributed systems, how to measure them, and the patterns used to achieve high availability.
A system can be available but unreliable (up but returning wrong data) or reliable but unavailable (down for maintenance). You want both.
| Availability | Downtime/year | Downtime/month |
|---|---|---|
| 99% | 3.65 days | 7.2 hours |
| 99.9% | 8.7 hours | 43.8 minutes |
| 99.99% | 52.6 minutes | 4.4 minutes |
| 99.999% | 5.3 minutes | 26 seconds |
High availability requires all of these together: