Summary
Overview
Work History
Education
Skills
Timeline
Generic

Rohan Agarwal

Site Reliability Engineer
Bengaluru

Summary

I am a Site Reliability Engineer with expertise in Kubernetes, Terraform, Helm, and Argo CD for cloud-native infrastructure automation.Managed complex deployments across GCP and Azure using GitOps practices, implementing CI/CD with GitHub Actions and Argo CD, and maintaining critical infrastructure components including MongoDB, Redis, Kafka, and RabbitMQ. Specialization in container orchestration, infrastructure as code, and monitoring solutions using Prometheus and Grafana stack.

Overview

1
1
year of professional experience
4
4
years of post-secondary education

Work History

SRE

Nference
Bengaluru
07.2024 - Current
  • Led the migration of critical databases (MongoDB, Redis, Kafka, RabbitMQ, MySQL,ETCD ) to Kubernetes using open-source operators, implementing robust backup solutions with automated cron-jobs storing data to cloud storage buckets.
  • Streamlined Helm chart deployments by integrating with Terraform, minimizing deployment risks and ensuring consistent version control across environments.
  • Implemented fine-grained RBAC in Argo CD, enabling secure and controlled access for different teams while maintaining system security.
  • Developed and maintained standardized Helm charts for application deployments across multiple environments, customizing configurations to ensure seamless integration with existing infrastructure.
  • Architected and automated deployment processes through Argo CD and GitOps practices, achieving 40% reduction in deployment times and 20% increase in release frequency.
  • Established infrastructure for AI Clinical Data platform across multi-cloud environments using Terraform, ensuring scalable and reliable deployments.
  • Successfully orchestrated the migration of servers from Ubuntu 18 to Ubuntu 22, including critical applications and tools like Redis, RabbitMQ, ETCD Clusters, Elasticsearch, and Spark.
  • Implemented comprehensive monitoring solutions using Prometheus, Grafana, Loki, and Promtail in Kubernetes environments, enhancing system observability and incident response capabilities.

SRE-Intern

Nference
Bengaluru
01.2024 - 06.2024
  • Architected and implemented a comprehensive App of Apps structure using Helm charts and Argo CD, standardizing deployment processes and significantly improving deployment reliability across multiple environments.
  • Developed reusable and modular Terraform modules for infrastructure provisioning, enabling consistent resource deployment across different environments and reducing configuration errors by standardizing infrastructure code.
  • Successfully led complex application migrations to Kubernetes, implementing secure secret management through External Secret Operator while ensuring zero downtime and maintaining system stability.
  • Established and optimized CI/CD pipelines using GitHub Actions, automating source code builds, artifact publishing to Nexus Repository, and container deployments to Kubernetes clusters.
  • Designed and implemented multi-cloud infrastructure solutions across GCP and Azure platforms, including GKE and AKS clusters, managing complex networking configurations with Nginx Ingress, Load Balancers, and security controls.
  • Engineered and managed infrastructure in air-gapped environments, implementing robust security measures and compliance controls while maintaining system isolation requirements.
  • Orchestrated Kubernetes cluster operations, including setup, configuration, and maintenance, while implementing resource optimization strategies that improved cluster efficiency and reduced operational costs.
  • Built and maintained comprehensive networking infrastructure including API Gateways, Firewall rules, NAT configurations, and Bastion Hosts, ensuring secure and efficient communication between services.

Education

Bachelor of Technology - Information Technology

Indian Institute of Information Technology,Lucknow
Lucknow, India
09.2020 - 06.2024

Skills

Infrastructure automation

Incident management

System monitoring

Infrastructure design

Problem-solving abilities

ArgoCD , Gitops

Kubernetes

Terraform

Puppet , Ansible

GCP, Azure

Timeline

SRE

Nference
07.2024 - Current

SRE-Intern

Nference
01.2024 - 06.2024

Bachelor of Technology - Information Technology

Indian Institute of Information Technology,Lucknow
09.2020 - 06.2024
Rohan AgarwalSite Reliability Engineer