Senior DevOps Engineer with 12+ years of hands-on experience building and operating large-scale, productiongrade platforms on AWS. Deep expertise in Terraform-driven Infrastructure as Code, Kubernetes (EKS), ECS, and CI/CD automation, with proven success in designing highly available, secure, and cost-optimized cloud architectures. Strong individual contributor known for owning end-to-end infrastructure, driving zero-downtime deployments, improving system reliability, and solving complex production issues in enterprise environments.
Overview
12
12
years of professional experience
Work History
Vice President
Goldman Sachs Pvt Ltd
06.2022 - Current
Architected a high-resiliency, multi-region AWS platform for a transaction banking application, designed to meet strict availability, durability, and fault-tolerance requirements expected in regulated financial systems.
Designed a regionally isolated architecture spanning US East and US West, ensuring application continuity in the event of AZ-level or region-level failures, while maintaining consistent transaction processing.
Implemented active-active / active-passive resilience patterns at the application and infrastructure layers to support disaster recovery, controlled failover, and operational stability.
Engineered a containerized compute layer using Amazon ECS, deployed across multiple Availability Zones, with load balancing and autoscaling to handle variable transaction volumes without service disruption.
Designed a highly available data layer using Amazon RDS Multi-AZ, ensuring synchronous replication, automated failover, and strong data durability aligned with banking SLAs.
Developed zero-downtime deployment strategies for ECS-based services, enabling seamless releases, configuration changes, and infrastructure updates without impacting live transaction flows.
Automated the entire infrastructure stack using Terraform, including VPC design, subnet isolation, routing, security groups, IAM roles, ECS services, RDS, and supporting AWS services, enabling repeatable, auditable, and compliant deployments.
Created modular and reusable Terraform patterns to enforce standardized architecture, security baselines, and environment consistency across production and non-production accounts.
Embedded resilience-first design principles such as AZ isolation, stateless service design, health checks, and automated recovery to minimize blast radius during failures.
Partnered with application and risk teams to align infrastructure design with RTO/RPO objectives, transaction throughput expectations, and regulatory compliance standards.
Actively supported production readiness and incident scenarios, ensuring rapid recovery, controlled failover, and minimal impact to critical banking operations.
AWS, IAC Terraform, RDS, Python, EKS and automation
Lead Engineer
Lululemon Athletic Canada
03.2022 - 06.2022
Lead DevOps Engineer supporting a large-scale global e-commerce platform, with primary ownership of AWS EKS-based Kubernetes infrastructure, GitOps deployments, and production observability.
Led end-to-end Kubernetes (EKS) platform engineering, managing highly available clusters for customer-facing e-commerce applications.
Implemented GitOps deployment model using Flux CD, enabling automated, auditable, and consistent application rollouts across environments.
Designed and maintained Helm charts for microservices, enforcing standardized deployment patterns, configuration management, and environment parity.
Built and operated CI/CD pipelines integrating GitHub / Jenkins with Flux CD for seamless continuous delivery to EKS.
Implemented blue-green and rolling deployment strategies on Kubernetes to ensure zero-downtime releases during peak retail traffic.
Set up cluster-level and application-level monitoring using Prometheus and Grafana, improving system visibility and reducing MTTR.
Defined and enforced Kubernetes best practices including resource limits, HPA, pod security policies, and namespace isolation.
Integrated AWS-native services such as ALB Ingress Controller, IAM Roles for Service Accounts (IRSA), and CloudWatch for secure and scalable operations.
Senior DevOps/Cloud Engineer responsible for end-to-end AWS platform setup and Kubernetes (EKS) cluster design for banking-grade, security-critical applications.
Designed and implemented complete AWS landing zone architecture including VPCs, subnets, routing, NAT, security groups, and IAM following banking security standards.
Led EKS cluster architecture and setup for enterprise banking applications with focus on high availability, fault tolerance, and regulatory compliance.
Designed multi-AZ EKS clusters with node group strategies, autoscaling, and workload isolation for critical banking services.
Implemented Helm-based deployment frameworks to standardize application onboarding and configuration management on EKS.
Integrated IAM Roles for Service Accounts (IRSA) to enforce fine-grained access control between Kubernetes workloads and AWS services.
Implemented secure networking patterns including private clusters, internal load balancers, and restricted ingress/egress controls.
Set up observability for Kubernetes workloads using Prometheus and Grafana, enabling proactive monitoring and alerting.
Automated infrastructure provisioning using Terraform, ensuring repeatable, auditable deployments across environments.
Collaborated closely with banking clients, security, and compliance teams to meet regulatory, audit, and risk requirements.
Supported production workloads with incident response, root cause analysis, and platform hardening.
Cloud Engineer supporting Finacle (core banking) as a SaaS offering, responsible for deploying and operating mission-critical banking applications on AWS and Kubernetes (EKS) for major banking clients.
Deployed and operated Finacle core banking platform as a SaaS solution on AWS cloud, supporting multiple enterprise banking clients.
Designed and implemented Kubernetes (EKS)-based deployment architecture for Finacle components, ensuring scalability, resilience, and high availability.
Led application deployment and lifecycle management on EKS, including configuration, upgrades, patching, and version rollouts.
Built and managed Helm charts to standardize Finacle deployments across environments and client instances.
Implemented secure AWS infrastructure including VPC design, IAM policies, security groups, and private networking aligned with banking compliance requirements.
Integrated monitoring and alerting for mission-critical workloads using Prometheus, Grafana, and CloudWatch.
Provided end-to-end production support for 24x7 banking systems, including incident management, root cause analysis, and performance tuning.
Collaborated directly with banking clients, application teams, and product owners to ensure SLA adherence and production stability.
Supported high-availability and disaster recovery setups for core banking workloads.
Ensured platform reliability during business-critical banking operations with minimal downtime.