Summary
Overview
Work History
Education
Skills
Certification
Acheivements
Hobbies
Languages
Accomplishments
Timeline
Hi, I’m

Shree Vallabh

Engineering Manger - Devops
Bangalore
Shree Vallabh

Summary

Versatile Principal DevOps Engineer with a proven track record of delivering exceptional results. Demonstrates a strong work ethic and unwavering commitment to quality, consistently meeting job demands and deadlines. Boasts over 11+ years of comprehensive professional experience in Linux environments, cloud operations (AWS and GCP), networking, security, devops, SRE practices and leadership.

Overview

11
years of professional experience
6
Certifications
3
Languages

Work History

Apna

Engineering Manager - Devops
10.2024 - Current

Job overview

  • Standardised cloud infrastructure by implementing consistent GCP and GKE resource management, including node pool segregation to align resource usage with teams and business units.
  • Mentored junior architects and senior engineers, building cross-functional competency in security, automation, and cloud-native technologies.
  • Spearheaded modernisation programmes to migrate legacy infrastructure to cloud-native stacks, with automated scaling and resilient design using JIRA workflow, Cookiecutter, Rundeck, and Jenkins.
  • Migrating incident management system from opsgine to zenduty from sratch.
  • Led Proof of Concept (PoC) initiatives for evaluating emerging DevSecOps tools, observability platforms, and improving reliability to reduce mean time to detect and respond (MTTD/MTTR).
  • Designed and deployed a production-grade observability stack using VictoriaMetrics, integrating Alertmanager, and Blackbox Exporter to ensure proactive incident detection and system reliability. Enabled granular monitoring by exposing Kong/Nginx ingress metrics aligned with SLOs, SLAs, and other performance indicators.
  • Delivered monthly cost savings of over £24,000 by downscaling compute resources, optimising GKE node pools, cleaning up unused GCP projects, autoscaling Elasticsearch, and optimising Datadog.
  • Migrated all Helm charts to ArgoCD and brought outdated charts back in sync with current configurations, enabling better monitoring, and upgrade workflows.
  • Led SSO implementation across all internal tools, migrated services behind VPN, enforced secure access, and restructured IAM policy.
  • Consolidated alerting systems by removing unused Datadog alerts, streamlining GCP alert policies, and centralising tracking for all infrastructure alerts.
  • Championed initiatives focused on security, observability, and cost optimisation by proactively identifying system gaps, driving automation, and aligning engineering goals with business priorities.
  • Led a team of 5-6 engineers, owning sprint planning, daily stand-ups, retrospectives, and the execution of high-impact infrastructure projects.
  • Built a cost dashboard for GCP using AI tools(cursor AI) that helped in understanding cost spends for cloud infrastructure and optimising further.
  • Created multiple automation workflows using AI tools (mcp and n8n)

Meesho

Principal Devops Engineer (Solution Architect)
10.2021 - 05.2024

Job overview

  • Implemented organisation-wide Single Sign-On (SSO) integrations for multiple DevOps tools, enhancing security and streamlining authentication.
  • Delivered and maintained highly modular infrastructure using Terraform and Terragrunt to standardise provisioning across GCP and AWS environments.
  • Developed a comprehensive monitoring platform with VictoriaMetrics and established logging and tracing with Coralogix for production systems on EKS, handling 13 trillion metrics data points.
  • Familiarity with AWS, GCP, Azure services, and multi-cloud architecture patterns.
  • Owned and defined the end-to-end architecture for cloud-native infrastructure, including Kubernetes multi-cluster environments (GKE/EKS), with a strong focus on scalability, security, and cost optimisation.
  • Led solution architecture initiatives for organisation-wide DevOps and security transformations, aligning closely with business and technical stakeholders.
  • Designed and deployed Kubernetes architecture, including a multi-cluster setup with Cilium cluster mesh, end-to-end automation using Terraform and Helm, and CI/CD pipelines with ArgoCD and Jenkins.
  • Planned and executed major EKS cluster in-place upgrades without downtime, utilising Terraform and Ansible, and implementing disaster recovery (DR) for critical services.
  • Led the migration of over 300 applications from EC2 to Kubernetes (EKS) within an 80-day timeframe, achieving zero application downtime.
  • Architected and deployed GitOps-based CI/CD workflows using ArgoCD and Jenkins, fully integrated with security scanners, policy enforcers, and other tools.
  • Managed capacity planning during peak sales and optimised systems at scale to enhance reliability and uptime in Kubernetes environments.
  • Contributed to the design of SRE (tracking incidents by severity, managing root cause analyses (RCAs), and handling uptime, alert configurations, and DevOps (scaling apps, CI/CD triggers, and configuration management) dashboards.
  • Implemented centralised alerting and incident response flows using Alertmanager and Coralogix for automated escalations and RCA tracking.
  • Deployed ELK and EFK stacks for full-stack visibility and enhanced operational telemetry, used for SLO/SLA monitoring, and dashboarding.
  • Ensured system stability during major sales events at Meesho, adhering to organisational standards and procedures.
  • Demonstrated extensive knowledge in distributed systems, DevSecOps, and cloud security.
  • Defined organization-wide DevSecOps and cloud security guidelines, including baseline IAM policies, CI/CD hardening, logging standards, and incident response protocols in coordination with InfoSec.
  • Enforced shift-left security practices by embedding automated compliance checks, code quality gates using SAST (SonarQube), and early testing in the development lifecycle.
  • Integrated security scanning tools such as Trivy, OPA, and OWASP-ZAP (DAST) into CI/CD pipelines to ensure vulnerability-free releases.
  • Provided consistent mentorship and leadership to a team of 4–5 members over the past three years.

Recko

Devops Engineer III (Devops Lead)
07.2020 - 10.2021

Job overview

  • Complete Infrastructure Management using Terraform in Amazon Cloud
  • Migrating and managing all microservices based applications on AWS EKS cluster
  • Implementation for various tools like kafka, logging stack(EFK) and monitoring stack(thanos) in eks cluster using helm charts
  • Security using cloudflare Disaster management plan with terraform and bash script CI/CD using jenkins and argo-cd
  • Load Balancing services using nginx ingress controller and ALB ingress controller
  • Implementing and managing all contemporary technologies such as java, nodejs applications in kubernetes(EKS)
  • Configuration management using helm charts for kubernetes clusters for major services
  • Monitoring and logging using Aws Cloudwatch, Thanos and grafana and Elasticsearch Fluentd and Kibana
  • Implementation and integration application tracing using jaeger
  • Automation using bash scripting and python

Traveloka

Devops Engineer T3
04.2019 - 05.2020

Job overview

  • Infrastructure Management using Terraform in Amazon Cloud
  • Multi account migrations from shared account to multi accounts in AWS cloud
  • Implementing and managing all contemporary technologies such as java, nodejs, go applications in Linux environment and AWS cloud
  • Configuration management and deployment using Ansible, Jenkins, Aws code build, code deploy and code pipelines
  • Migrated java services from ec2 architecture to AWS ECS using terraform modules
  • Developed a single page application dashboard for monitoring of idle/stopped instances, generating dynamic pdf and sending email notifications for leads using python flask
  • Monitoring regularly and Decommission of unused resources in AWS cloud for cost savings
  • Leading on cost optimisation dashboard project which helps in visibility of total 180 accounts(multi accounts) cost and recommendation for cost optimisation
  • Support 24/7 on production servers and troubleshooting issues

Swoo, Abu Dhabi Financial Group

Devops Engineer
09.2018 - 04.2019

Job overview

  • Created and executed automation processes, enhancing application scalability and functionality.
  • Implementing and managing all contemporary technologies such as Nginx, Tomcat servers, in Linux Environment's
  • Managing java, go, and node applications on AWS cloud
  • Implementing the above applications on internal development, staging and production servers
  • Continuous Integration and Deployment using Jenkins, Ansible, Groovy, circleci and
  • Monitoring using Aws Cloudwatch, Datadog, prometheus and grafana
  • Iaas using Terraform Support 24/7 on production servers and troubleshooting issues.

Mobinius Technologies Pvt Ltd

Senior System Administrator
08.2016 - 07.2018

Job overview

  • Managing IT Infrastructure (VM's) and Virtual Private Networks in Amazon Web Service's cloud
  • Implementing and managing all contemporary technologies such as Apache, HAproxy, Nginx
  • Monitoring through Nagios, Prometheus and grafana that provides visualization, alerts for server/ services being online, data is flowing actively from customers, connectivity is available and data is being stored properly
  • Linux Production Server Administration: VM Setup, Creation, Backup, Fileserver, LVM configuration, security patch update etc
  • Release management: Provide release deployment support for all environments and closely work with Development and Testing teams for new features/Application patches and releases
  • Build, Continuous integration, deployment and Release through Jenkins
  • Delivered reliable support for all server-class systems
  • Secure servers with certificate support(DigiCert, LetsEncrypt, Amazon Cert Manager) for the endpoint technologies being deployed to the system
  • Set up, optimised and managed network equipment
  • Assessed latest innovations and adopted cost-effective, useful solutions
  • Keep a track of the issues, tickets and provide the best resolution.

Micro E Project

Software Engineer
08.2014 - 08.2016

Job overview

  • Building web applications using php and mysql
  • Designing of power and control circuits, interfacing of Arduino, Raspberry pi's, GSM, wi-fi and relays for IOT products
  • Involved in phases of the project including technical design, development, testing, Debugging, implementation, documentation and incorporation of user feedback to plan enhancements in the application system
  • Worked on IOT product development, hosting, deployment and involved in implementation and commissioning of products on site
  • Automation and Integration of the application with the external system
  • Directing the overall system administration activities in different client locations

Education

VTU

BE from Computer Science Engineering

Board of Karnataka

PUC from Science

Board of Secondary Education

SSLC from School

Skills

Linux administration and networking

Certification

RHCSA and RHCE (2017)

Acheivements

  • Leading devops team in migrating 300+ micro services to kubernetes EKS without any downtime in meesho which reduced 35% of cost (2023). Now leading devops team in migrating 600+ micro services to GKE without any downtime in meesho.
  • Implementation of monitoring stack at scale upto 13trillion metric datapoints in single cluster in meesho with SRE best practices.
  • Implementation of kubernetes on EKS from scratch and migration of 40 applications in production without any downtime in recko which reduced 30% of cost (2020)
  • Cost optimization in SWOO and Traveloka saving huge money (2018-19).
  • Moving from VM based servies, provided cost effective proposals and moved infrastracture to AWS cloud in mobinius. (2017)

Hobbies

  • Playing and watching cricket, volley ball and badminton
  • Reading books and Binge-watching

Languages

  • English
  • Kannada
  • Telugu
  • Hindi

Accomplishments

    Leading devops team in migrating 300+ micro services to kubernetes

    EKS without any downtime in meesho which reduced 35% of cost

    (2023) and also led devops team in migrating 600+ micro services to

    GKE without any downtime in meesho.

    Implementation of monitoring stack at scale upto 13trillion metric datapoints in single cluster in meesho with SRE best practices.

    Implementation of kubernetes on EKS from scratch and migration of

    40 applications in production without any downtime in recko which reduced 30% of cost (2020)

    Cost optimization in SWOO and Traveloka saving costs (2018-19)

    Moving from VM based servies, provided cost effective proposals

    and moved infrastracture to AWS cloud in mobinius. (2017)

Timeline

Engineering Manager - Devops

Apna
10.2024 - Current

Principal Devops Engineer (Solution Architect)

Meesho
10.2021 - 05.2024

Devops Engineer III (Devops Lead)

Recko
07.2020 - 10.2021

Devops Engineer T3

Traveloka
04.2019 - 05.2020

Devops Engineer

Swoo, Abu Dhabi Financial Group
09.2018 - 04.2019

Senior System Administrator

Mobinius Technologies Pvt Ltd
08.2016 - 07.2018

Software Engineer

Micro E Project
08.2014 - 08.2016

Board of Karnataka

PUC from Science
2009

Board of Secondary Education

SSLC from School
2007

VTU

BE from Computer Science Engineering
2013
Shree VallabhEngineering Manger - Devops