Postmortems/Incident Reports/War Stories from real-world failures

Algolia

AWS

Bungie

Cloudflare

DataSpring

  • Datacenter and tornado
    • Website is in Czech, use a translation service to read it. archive.today link
    • This is a story of how a data center dealt with a tornado, a good reminder to verify your offsite backups, a disaster recovery plan, and conduct disaster recovery dry runs.

Facebook

Fastly

Garmin

GitHub

Google Cloud

Grafana Labs

Independent Stories

Indian Registry for Internet Names and Numbers (IRINN)

Netflix

Salesforce

Slack

Twitter

Verizon