
Seasoned Devops Engineer with over 10+ years of experience driving operational excellence, service resilience, and scalable infrastructure across complex, mission-critical systems. Proven track record in leading SRE and technical support teams, managing business collaboration platforms, and delivering innovative solutions to enhance availability, reliability, and performance. Expert in SRE best practices, including SLO/SLI definition, observability (AppDynamics, Datadog, Splunk), incident management, and automation of operational toil. Adept at architecting and deploying modern, containerized applications using Docker, Kubernetes, and AWS ECS, and building robust CI/CD pipelines with GitLab, Jenkins, and Terraform. Strong leadership in cross-functional collaboration across development, networking, end-user support, and cloud engineering. Passionate about fostering a culture of technical excellence, building self-service tools, and aligning engineering outcomes with business objectives. Demonstrated ability to manage hybrid environments, modernize legacy systems, and lead global teams in high-availability production environments. Senior Site Reliability Engineer with proven success in reducing incident response times by 40% and optimizing deployment processes through automation. Expert in AWS, Docker, and Kubernetes, contributing to enhanced system reliability.