Over 11 years of experience in DevOps and Build and Release Engineering, specializing in automation and infrastructure management across on-premises and AWS cloud environments. Demonstrated success in architecting scalable, fault-tolerant solutions using infrastructure as code (IaC) and CI/CD pipelines for mission-critical applications. Expertise in networking concepts, ensuring secure and robust infrastructure designs. Proven ability to automate resource management and implement monitoring tools, optimizing costs while maintaining application uptime and CI/CD efficiency.
Overview
11
11
years of professional experience
Work History
Senior Operations Engineer
Khoros LLC
Bengaluru
04.2024 - Current
Maintain and manage Amazon EKS and vanilla Kubernetes clusters, ensuring high availability and optimal performance across environments hosting 100 or more microservices.
Migration of Kubernetes workloads from Flux to ArgoCD, improving release management through features like automated rollback, and enhanced deployment visibility.
Automate infrastructure provisioning using Terraform, and manage configuration with Ansible, including Ansible Tower and Red Hat AAP, for streamlined cluster bootstrapping and environment management.
Own and manage all AWS cloud resources tied to customer-facing applications; handle provisioning, updates, and decommissioning of infrastructure using CloudFormation templates.
Led initiatives to incorporate DevSecOps practices, embedding security scanning and vulnerability assessment tools, such as Twistlock and Checkmarx, into automated pipelines to detect code and infrastructure vulnerabilities early in the development lifecycle.
Develop cost optimization strategies, such as identifying and deleting unused EBS volumes, unattached Elastic IPs, and idle snapshots, reducing unnecessary storage costs, and aligning with AWS best practices.
Implementing RDS scaling policies based on usage metrics.
Leveraged Karpenter on EKS to enable on-demand autoscaling of EC2 nodes, optimizing workload distribution by dynamically selecting the most suitable instance types based on resource requirements and availability, leading to significant compute cost savings and improved efficiency compared to managed node groups or static node pools.
Implement monitoring and observability practices using Datadog and Prometheus, setting up dashboards, alerts, and metric-based health checks to proactively detect and resolve issues.
Manage infrastructure maintenance activities, including OS patching, Kubernetes upgrades, and cloud service improvements, by coordinating with Engineering, SRE, and QA teams to minimize impact and avoid downtime.
Participate in on-call rotations, ensuring 24/7 support for production systems.
Enhance cloud and Kubernetes security by refining IAM roles, RBAC policies, and access controls to align with best practices and compliance standards.
Principal Software Engineer
Dell Technologies Ltd.
Bengaluru
01.2023 - 03.2024
Designed and implemented CI/CD pipelines for Kubernetes-based microservices using Groovy-based shared libraries in Jenkins and ArgoCD for GitOps-based deployments, reducing deployment time, and increasing reliability.
Enforced Kubernetes network policies and implemented horizontal pod autoscaling using metrics-server, ensuring workload isolation and efficient resource utilization.
Worked on initiatives that optimized cloud resource utilization and cost management, resulting in monthly savings of approximately $20,000 for the organization.
Built event-driven microservices using Confluent Kafka for real-time data processing in distributed systems.
Handle incident management and post-incident reviews.
Installed and maintained a centralized logging cluster using Fluentd, Logstash, Elasticsearch, and Kibana (ELK stack) to aggregate and parse logs from 10+ microservices and associated Docker containers, improving visibility and incident response time.
Automated the rotation of secrets and credentials using AWS Secrets Manager, enhancing security posture by ensuring timely updates. Implemented auto-tagging Lambda functions for cost attribution, ensuring all resources are tagged by team, environment, and project for clear cost visibility.
Implemented IAM policy audits and enforced least privilege policies by leveraging Lambda checks and AWS Config rules.
Refactored Terraform code into reusable modules to enhance scalability and consistency across environments, as well as implemented pre-commit hooks to enforce formatting, validation, and best practices.
Software Engineer IV - DevOps
Cisco
Bengaluru
10.2020 - 12.2022
Developed and maintained Ansible playbooks to automate the provisioning of Kubernetes clusters and supporting tools (e.g., ingress controllers, monitoring agents, logging stacks).
Implemented end-to-end CI/CD pipelines using Jenkins, Git, and Ansible to automate code integration, Docker image builds, testing, and deployment across environments.
Introduced GitOps workflows using ArgoCD to manage Kubernetes cluster configurations and application delivery, ensuring consistency, traceability, and automated rollbacks.
Built Python-based automation to extend the legacy deployment process for compatibility with a hybrid (on-premises + cloud) architecture.
Migrated a major retail client’s monolithic applications from on-premise infrastructure to AWS, performing a lift-and-shift deployment onto Amazon EC2 instances while preserving the existing architecture.
Enabled backups, EBS snapshot automation, and lifecycle policies to support disaster recovery, and long-term retention.
Implemented security hardening through EC2 instance roles, SSH key rotation, Security Groups, and NACLs, aligned with enterprise compliance standards.
Collaborated with engineering teams to drive and support the migration from legacy application platforms to Kubernetes, providing technical guidance, best practices, and hands-on enablement.
Developed a Lambda function that monitors CloudWatch Log Groups, and parses application logs in real time. Pushed these logs to Sumo Logic for centralized monitoring, thereby enhancing operational insights and faster troubleshooting.
Senior DevOps Engineer
Sabre Corporation
Bengaluru
09.2018 - 10.2020
Deployment of multi-account AWS environments, implementing centralized logging and monitoring to enhance operational visibility.
Implemented security measures, including IAM roles and policies, to safeguard cloud resources, and ensure compliance with client standards.
Analyzed existing AWS infrastructure and implemented optimizations, including instance right-sizing and EBS lifecycle policies.
Set up comprehensive monitoring of production workloads using CloudWatch, Lambda, AppDynamics, and Kibana, enabling real-time alerting, performance visibility, and log aggregation.
Improved CI/CD workflows by integrating Jenkins, Git, and Ansible, introducing automation for build, test, artifact creation, and zero-downtime deployments across environments.
Introduced gated stages with manual approvals (e.g., via Jira ticket) for production releases, aligned with audit and compliance needs.
Migration of production workloads for travel sector clients from on-premise to AWS, ensuring secure data transfer, minimal downtime, and infrastructure readiness for scaling.
Implemented automated rollback strategies, blue/green deployments, and infrastructure validation steps within the CI/CD pipeline to ensure release reliability.
Lead DevOps Engineer
Envestnet Yodlee
Bengaluru
03.2018 - 09.2018
Support engineering plans and schedules by providing release schedules and services to build, deploy, develop scripts, oversee branch and merge strategies, and build automated tools as necessary.
Optimized pipeline execution time by parallelizing jobs, and introducing incremental builds.
Implemented environment-specific configuration management and secrets handling within CI/CD workflows to ensure secure and consistent deployments.
Technical Lead
Cognizant Technologies Solutions
Bengaluru
03.2015 - 03.2018
Configured and managed secure cloud infrastructure by creating IAM roles and policies to enforce least-privilege access, and setting up Security Groups and NACLs within public/private subnets in VPCs. This included restricting SSH access to bastion hosts and isolating backend services in private subnets for an enhanced security posture.
Administered development and collaboration platforms such as JIRA and Confluence by managing user permissions, configuring workflows, and automating project reporting. Oversaw Nexus and Artifactory upgrades, migration of SVN to Git, and integrated code repositories with CI tools for full traceability and audit compliance.
Developed and monitored custom dashboards for performance and reliability by integrating Prometheus with Grafana to track application metrics, like CPU, memory, and latency. Used Splunk for log aggregation and alerting, enabling root cause analysis, and proactive resolution of production incidents.
Implemented Infrastructure as Code and configuration management by leveraging CloudFormation templates for provisioning EC2 instances, RDS, and S3 buckets. Automated configuration tasks with Ansible playbooks maintain consistency across environments and reduce deployment time from hours to minutes.
Automated build, test, and deployment pipelines using Jenkins, integrated with GitHub webhooks, triggering builds on code commits, and deploying artifacts to QA/staging environments.
Build and Release Engineer
Informatica
05.2014 - 03.2015
Created build scripts – Maven, ANT, and bash-related automated deployment procedures as part of the build and deployment workflow.
Handled configuration management tasks, such as the integration and configuration of tools like Apache HTTPD, Tomcat, Nagios, Nexus, etc.
Migrated legacy repositories to Perforce, ensuring minimal downtime, and full auditability..