Results-driven Site Reliability Engineer (SRE) with 5+ years of experience ensuring system availability, scalability, and performance across enterprise environments. Proficient in implementing core SRE principles (SLOs, SLIs, error budgets, incident management, RCA, RTO/RPO) and delivering high-impact solutions in Microsoft Azure, Docker, Kubernetes, and AWS. Skilled in CI/CD pipelines (Azure DevOps, GitHub Actions), monitoring/APM tools (Promethenus, Grafana), and automation scripting using Python. Adept at incident management, observability, and collaborating with cross-functional teams in fast-paced, production-critical environments. Quick learner, cloud-agnostic mindset, and passionate about applying SRE principles. Proven ability to optimize infrastructure, minimize downtime, and lead cross-functional teams toward operational excellence.
Key Achievements:
SKILLS
• Microsoft Certified: Azure Fundamentals (AZ-900)
• Microsoft Certified: Azure Administrator Associate (AZ-104)
• Microsoft Certified: DevOps Engineer Expert (AZ-400)
• HashiCorp Certified: Terraform Associate – In Progress