Loading video player...
š§ From Alert to Resolved: Self-Healing Azure Platform with SRE Agent Your SRE team is spending nights triaging alerts, stitching together logs from five different tools, and writing the same postmortem for the third time this quarter. The problem isn't their skill ā it's the toil. This session is for platform engineers and SREs who run Azure in production and are ready to move beyond reactive ops. Azure SRE Agent went GA on March 10th, 2026! ā What You'll Learn: ⢠Design a subagent topology for platform reliability with domain experts and task specialists ⢠Wire Azure SRE Agent into your DevOps pipeline ā GitHub/Azure DevOps, PagerDuty/ServiceNow, Azure Monitor ⢠Seed the agent's knowledge base with runbooks, postmortems, and architecture docs ⢠Govern autonomy safely using Read-only, Review, and Autonomous modes ⢠Measure impact: track MTTR reduction, on-call toil hours saved, and incident volume trends š¤ Speaker: ⢠Lee Oommen - Principal Cloud & AI Solution Architect at Microsoft with 18 years of experience specializing in Azure IaaS, Kubernetes, hybrid cloud, and reliability engineering š Day 2 | 45 minutes š Subscribe for more Azure Infrastructure content!