Agile + DevOps East 2020 Concurrent Session : Before Disaster Strikes: Training DevOps Engineers for the Worst

Conference archive


Thursday, November 12, 2020 - 11:45am to 12:45pm

Before Disaster Strikes: Training DevOps Engineers for the Worst

Picture this: you are startled awake in the middle of the night by a phone call from your supervisor. An emergency has occurred in production, and the only description is that a heavily trafficked site is down. You rush to a conference call with five of your colleagues to find that everyone has a different assessment about what the problem is and how to fix it. There’s no plan in place for this, and as the DevOps engineer, the decision and responsibility for fixing the problem is yours. There’s only time to try one of these methods; you have minutes, not hours, to find the issue and get the site back up and running before the business is critically impacted. This is a common scenario which is familiar to many DevOps engineers. Although disasters happen often, most companies have no plan or preparation for these situations. Like firefighters or first responders, DevOps engineers need to be prepared for a crisis before it happens, both technically and from a disaster management perspective. My talk will describe how to train DevOps engineers to control and manage a crisis before one strikes.

Brought to you in partnership with:


Alireza Chegini
On Air

Alireza Chegini has more than 20 years’ experience in software development. Starting in the customer support department of an Enterprise Application, later he began developing financial applications with the .NET framework and SQL. Since then, he has been involved in many projects, taken different roles, using different programming languages and various tools that have made him quite experienced in all parts of the SDLC. In recent years, he started implementing DevOps practices. Since 2016, he joined a Fintech company as a Site Reliability Engineer, helping the company to move away from traditional development workflows and embracing DevOps culture. Later, Alireza implemented a fully automated DevOps pipeline for several banking projects. This summer he joined On Air as DevOps manager to help out on live event streaming platforms.