Accomplished Senior Site Reliability Engineer with 8+ years of expertise in architecting, migrating, and optimizing large-scale data platforms on Kubernetes (AWS EKS). Proven leader in spearheading organizational Kubernetes adoption, automating application onboarding processes, and migrating critical services from SaaS(e.g. Airflow, Spark, JupyterHub) to cost-effective self-managed infrastructure. Expert in engineering zero-downtime deployment strategies, building robust CI/CD pipelines (ArgoCD, Helm, Jenkins), and implementing comprehensive observability solutions. Adept at developing custom automation tools for resource optimization and initiating governance frameworks for streaming platforms like kafka. Experienced in scaling DevOps teams and mentoring engineers.