

Principal Cloud Platform Engineer and Site Reliability Engineer with 9.7+ years of experience designing, operating, and scaling large-scale, high-availability production systems across Microsoft Azure, Google Cloud Platform, and AWS. Proven expertise in SRE principles (SLO, SLI, SLA), Kubernetes-based platforms, Infrastructure as Code, CI/CD automation, and deep Linux and networking fundamentals.
Strong background in cloud platform engineering, DevOps automation, and reliability ownership, including incident management, root cause analysis, MTTR reduction, and observability. Experienced in building secure, resilient, and compliant cloud architectures using policy-as-code, defense-in-depth security models, and governance frameworks. Demonstrated ability to lead DevOps and SRE initiatives, mentor engineers, and collaborate with cross-functional stakeholders to deliver reliability-first, scalable, and cost-optimized platforms.
Microsoft 365
ChatGPT
DrawIO
Rancher
Cricket
Travel
Technical Blogs
Mentoring and Coaching