What is Failure Modes? #

Failure Modes is a community of software practitioners that likes to share and discuss the failure modes they have seen in production systems.

We host meetups in Bangalore, India.

Meetups are invite-only and follow Chatham House Rule to ensure a safe space to share failures.

Chatham House Rule: When a meeting, or part thereof, is held under the Chatham House Rule, participants are free to use the information received, but neither the identity nor the affiliation of the speaker(s), nor that of any other participant, may be revealed.

Source - chathamhouse.org

See the meetup page for more details about the meetup.

Browse around this website and explore the collection of incidents and learn from literature on how and why software systems fail, and how we can build better systems

Why Failure Modes? #

Running software systems in production is hard, and running distributed systems in production is even harder.

Failure Modes is an effort to curate resources and stories from the community, to learn and get better at running large scale software in production.

Contributing to Failure modes #

  • Observed an interesting failure mode in your day-to-day job? Saw an interesting post-incident report from a company?, create an issue to add that to our collection.

  • Wrote a blog post about a failure you saw in production? create an issue with blog post link to add it to our collection.

  • Saw something interesting about failure modes on the internet? create an issue and share the link. It can be anything from incident postmortems, blog posts, projects, talks, tweets, research, and more.

Huge thanks to our contributors :bowing_man: :bowing_woman: :tada:

Have suggestions or questions? Reach out to Suraj on twitter @electron0zero or open an issue