Summary
Overview
Work History
Education
Skills
Certification
Accomplishments
Projects Handled
Timeline
Generic

Sandeep Sunku

Hyderabad

Summary

Dynamic and results-driven Program Manager with over 14 years of experience in leading large-scale, cloud-native technology programs. Specialized in DevOps transformation, SRE implementation, and agile delivery across global teams. Proven ability to align engineering execution with business goals, drive operational excellence, and deliver scalable, reliable, and secure infrastructure solutions.

Overview

15
15
years of professional experience
3
3
Certifications

Work History

Program Manager, DevOps & SRE

Phenom
09.2021 - Current
  • Spearheaded delivery of multiple microservices-based platforms on AWS and GCP using Kubernetes and Terraform.
  • Designed and implemented SRE practices including SLIs/SLOs, incident response, and postmortems.
  • Led CI/CD modernization using Jenkins and GitLab CI, reducing deployment time by 40%.
  • Established governance frameworks for release management, compliance, and infrastructure cost optimization.
  • Mentored and scaled a high-performing DevOps team; fostered a culture of ownership and continuous improvement.
  • Acted as the primary liaison between engineering, product, and business stakeholders.

Senior DevOps Engineer / Project Lead

Catlight Health/Pramati Technologies/Indmax Solution
10.2010 - 09.2021
  • Delivered cloud-native infrastructure on AWS and Azure using IaC (Terraform, CloudFormation).
  • Managed Kubernetes clusters (EKS/AKS) with auto-scaling, DR, and blue-green deployments.
  • Implemented observability stack (ELK, Prometheus, Grafana), reducing MTTR by 60%.
  • Transitioned legacy monoliths to microservices, doubling release velocity.
  • Collaborated with product owners to define sprint goals and delivery milestones.

Education

MBA - Computer Science

ISBM
Chennai, India
04.2001 -

Skills

Agile/Scrum

Certification

AWS Certified DevOps Engineer – Professional

Accomplishments

  • Delivery Excellence: Reduced average release cycle by 50% and improved predictability.
  • SRE Maturity: Introduced SLOs and error budgets, improving system reliability to 99.95%.
  • Process Innovation: Built a delivery maturity model for CI/CD, SDLC, and observability.
  • Stakeholder Engagement: Enhanced visibility and trust through structured reporting and communication.
  • Team Growth: Mentored 15+ engineers, fostering a collaborative and resilient DevOps culture.

Projects Handled

  • AWS Cloud Migration, Led the end-to-end migration of 100+ applications from on-prem to AWS, managing cross-functional teams across 3 regions. Delivered with zero downtime and 25% cost savings.
  • Security & Vulnerability Remediation, Directed a DevSecOps initiative to patch critical vulnerabilities across 300+ systems. Integrated security tools with CI/CD and ensured full compliance without production impact.
  • RHEL 7 to 8 Upgrade, Oversaw the upgrade of 200+ Linux servers, coordinating across DevOps, infra, and app teams. Automated workflows and ensured zero disruption with full compatibility.
  • Disaster Recovery Program, Directed the design and implementation of a resilient DR network architecture, ensuring alignment with business-defined RTO/RPO objectives and maintaining high availability across critical services.
  • Containerization Strategy, Program-managed the transformation of 500+ microservices from binary-based deployments to Docker containers, enabling standardized and scalable delivery pipelines.
  • Kubernetes Modernization, Led the enterprise-wide migration of 1,000+ microservices to a production-grade Kubernetes platform, driving automation, high availability, and efficient resource utilization.
  • Cloud Infrastructure Optimization, Oversaw the migration of 500+ EC2 instances to a modern VPC architecture, enhancing security, performance, and operational agility through subnet segmentation and NAT gateway integration.
  • Crisis Management & DNS Recovery, Successfully led emergency DNS migration from GoDaddy to AWS Route 53 during a critical outage, restoring services within 3 hours and minimizing business impact.
  • Environment Automation, Spearheaded automation of ITX environment provisioning, reducing lead time from 3 months to 2 weeks through cross-functional collaboration and process streamlining.
  • Cloud Cost Optimization, Directed a cost-efficiency initiative that migrated workloads to AWS Graviton instances, achieving $500K in annual savings while improving performance-to-cost ratios.

Timeline

Program Manager, DevOps & SRE

Phenom
09.2021 - Current

Senior DevOps Engineer / Project Lead

Catlight Health/Pramati Technologies/Indmax Solution
10.2010 - 09.2021

MBA - Computer Science

ISBM
04.2001 -
Sandeep Sunku