Before Disaster Strikes: Training DevOps Engineers for the Worst
Picture this: you are startled awake in the middle of the night by a phone call from your supervisor. An emergency has occurred in production, and the only description is that a heavily trafficked site is down. You rush to a conference call with five of your colleagues to find that everyone has a different assessment about what the problem is and how to fix it. There’s no plan in place for this, and as the DevOps engineer, the decision and responsibility for fixing the problem is yours. There’s only time to try one of these methods; you have minutes, not hours, to find the issue and get the site back up and running before the business is critically impacted. This is a common scenario which is familiar to many DevOps engineers. Although disasters happen often, most companies have no plan or preparation for these situations. Like firefighters or first responders, DevOps engineers need to be prepared for a crisis before it happens, both technically and from a disaster management perspective. My talk will describe how to train DevOps engineers to control and manage a crisis before one strikes.
Brought to you in partnership with: