

Reliability & MLOps Engineer with expertise in designing and managing high-scale SaaS platforms on Amazon EKS. Skilled in infrastructure automation using Terraform, Python, Ansible, and Go to ensure high availability and security. Proven track record of optimizing CI/CD pipelines and implementing advanced deployment strategies, reducing deployment incidents by 70%. Experienced in applying AIOps techniques with Prometheus, Grafana, and Loki to enhance ML-based noise reduction and decrease Mean Time to Recovery (MTTR).