A while back, one of my company’s more senior types was wandering around our project area (he likes to wander around project areas and ask some challenging questions) when he stopped at my desk and asked me about Chaos Testing.
Popularized by companies like Netflix and their tool Simian Army, Chaos Testing (or Engineering) is a technique that simulates outages in your system, and sees how it reacts. (Crudely, you can think of this as walking around your server room and pulling a random cord and seeing what happens)
While I haven’t dived heavily into it, I do have a few resources that might be of use to others…does anyone have others?
A good general article to get you started:
Another list of resources that might be useful for getting started (I haven’t looked as heavily at this one):
Chaos Monkey (the successor to Simian Army) on Github to play around with: