The vision of the Azure Production Infrastructure Engineering group is to make it easy for everyone to create, consume, and manage planetary-scale, reliable cloud production services and infrastructure to achieve more.
The Azure Incident Management Team are responsible for driving complex multiservice outages to resolution in a timely and effective manner through coordination of internal Azure service teams and key stakeholders. This requires composure under pressure, broad technical, analytical and problem-solving expertise, ability to confidently collaborate with varied partners, and great written and spoken communication.
We are recruiting an experienced team leader who can help create a culture based on growth mindset and customer obsession. As well as personally driving incidents to swift resolution, you will be responsible for recruiting excellent talent, helping them grow in career and strengthening our world class Incident Management function through hyper effective global team work. Participation in the evolution of the Incident Management discipline across Azure will also be a key expectation. Your work in this role will use cutting edge technologies and industry concepts to directly prevent millions of minutes of downtime for customers worldwide. Your scope may also include the management of complex, multi-team projects that result from incidents and crises you manage.
+ Participate in building a world class Incident Management function for one of the world’s fasted growing cloud platforms.
+ Recruit, mentor and develop excellent team members.
+ Be passionate about cloud, customer focused, and a technologist at heart
+ Effectively coordinate production incidents across multiple organizations within Microsoft
+ Drive continuous swift momentum towards mitigation, asking leading technical questions, offering suggestions around troubleshooting direction
+ Provide excellent incident communication to stakeholders
+ Identify opportunities and take ownership for automation and/or continuous improvement of Incident Management process and best practices
+ Lead and/or participate in Post Incident Review and Problem Management meetings with key stakeholders and service owners to review events and opportunities for ongoing improvement
+ Work within a “Follow the Sun” global shift rota, covering local day-time hours, including holidays and weekends, on a rotational basis
+ Bachelor’s degree or 5 years equivalent work experience
+ 6 years direct team management experience
+ Must be technically literate and be able to articulate technical issues in a meaningful way to both engineers and executive level management
+ The ability to communicate confidently and clearly on conference calls, in meetings and via email, at all levels of the organization is essential
+ Crisis management skills: able to set priorities, pursue multiple threads at the same time, accurately reflect current state and drive towards desired state
+ Ability to maintain calm during stressful situations; demonstrated leadership skills under fast-paced, highly dynamic situations
+ Strong design, scripting, problem solving and debugging skills
+ Strong collaboration skills: working across teams and organizations is necessary to be successful
+ Experience managing complex projects spanning multiple teams and organizations
+ Must be able to participate in a multi-location on-call rotation
+ Candidates must pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
+ Software Engineering/Development/Technical Program Management experience
+ Knowledge of Microsoft Azure, AWS, GCP or similar cloud computing platforms
+ Expertise building, delivering and supporting extensible, high scale service platforms
+ Expertise in debugging and remediating issues in large-scale distributed systems
+ PMP, ITIL, Six Sigma with demonstrated application towards service improvement
+ Confident in collaborating, building trust and respect with people outside of the immediate team
+ Excellent project/program management skills with great attention to detail
+ Demonstrates technical excellence by applying engineering principles to solve complex problems
+ Strong reporting and analytics experience
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form at https://careers.microsoft.com/us/en/accommodationrequest .
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.