Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Generic

Ravindra Prasad

Sr DevOps Manager
Bangalore

Summary

DevOps/Platform Engineering Architect and Technical Evangelist with 19+ years delivering enterprise-scale platforms and digital transformations. Hands-on in Python (NumPy, Pandas, Matplotlib) for reliability/capacity analytics; expert in Kubernetes/Helm and GitOps at scale. Administered Argo CD for 700+ applications with app-of-apps, multi-tenant RBAC, and policy guardrails, achieving 99.95% SLOs, MTTR ↓40–60%, and deployment frequency ↑3–10x. Built and governed observability stacks (Prometheus, Grafana, ELK/EFK, OpenTelemetry) and platform governance (OPA/Gatekeeper, secrets/IAM). Designed and operated large-scale Kafka-based event platforms and cloud-native foundations with Terraform/IaC and CI/CD. Champion Agile+DevOps craftsmanship, mentoring teams and optimizing reliability, velocity, and cost (15–25% savings via FinOps). Targeting Senior/Principal DevOps, Platform Engineering Lead, or DevOps Architect roles.

Overview

19
19
years of professional experience
2
2
Languages

Work History

Sr DevOps Manager

Transunion
Bangalore , Karnataka
08.2021 - Current
  • Operating 50+ EKS clusters (AWS+GCP) across DEV/UAT/PROD with GitOps; 700 apps via Argo CD; sustained 99.95% SLOs and reduced MTTR by 40–60%
  • Security hardening via Bottlerocket migration on worker nodes and curated base images; reduced critical/high CVE exposure by 60–80% and shrank patch windows from weeks to hours
  • FinOps: $500/month immediate AWS bill reduction through rightsizing and image/compute optimizations; runway to scale savings across 50 clusters into multi-$K/month
  • Operated and optimized 50+ EKS clusters across AWS and GCP spanning DEV/UAT/PROD; standardized GitOps with Argo CD for 700 applications, sustaining 99.95% SLOs and improving MTTR by 40–60%.
  • Migrated Kubernetes worker nodes to Bottlerocket and curated base images; reduced critical/high CVE exposure by 60–80%, eliminated OS drift, and cut patch cycles from weeks to hours via immutable updates.
  • Delivered FinOps improvements with rightsizing, pod density tuning, and storage tiering; achieved $500/month immediate AWS savings and set autoscaling/policy baselines to expand into multi-$K monthly optimizations across 50 clusters.
  • Built Terraform-based EKS landing zones and reusable modules; reduced environment provisioning from weeks to 1–2 days and lowered change failure rate by ~30% through policy-as-code and automated rollbacks.
  • Implemented unified observability with OpenTelemetry, Prometheus, Grafana, and ELK; defined SLIs/SLOs, alert hygiene, and runbooks that reduced false alerts by ~35% and accelerated detection/recovery by 40–60%.
  • Established compliance-by-design: SAST/DAST/SCA, IAM/secrets hardening, OPA/Gatekeeper admission controls, SBOM scanning (Syft/Trivy); improved audit readiness and image governance for 700 apps across 50 clusters.
  • Led a 12–15 engineer DevOps/platform team; hiring, mentoring, OKRs, incident/postmortem governance, and cross-functional steering with product and security.
  • Built observability accelerators for 50+ EKS clusters and 700 apps: authored and productionized custom Prometheus exporters/integrations including Prom2Teams, Event Exporter, Certificate Exporter, CSR Exporter, and Elastic Exporter to standardize metrics and actionable alerting across DEV/UAT/PROD.
  • Implemented deep datastore monitoring for MongoDB and Redis (replication lag, cache hit ratio, memory/evictions, latency p95/p99, slow ops); delivered golden dashboards and SLO-aligned alerts that preempt capacity and performance incidents.
  • Engineered an enterprise alerting framework with 400+ Prometheus rules mapped to SLIs/SLOs and runbooks; reduced false alerts by ~30–40% and improved detection/recovery times by 40–60%, stabilizing on-call operations.
  • Integrated Alertmanager with Microsoft Teams via Prom2Teams, enabling severity-based routing, deduplication, inhibition, silences, and maintenance windows; improved triage quality and reduced alert fatigue.
  • Automated certificate hygiene using certificate/CSR exporters (expiry windows, issuance failures, trust chain validation); eliminated certificate-related outages and cut renewal toil through proactive alerting and ownership tagging.

Solution Architect (DevOps)

Accenture
Bangalore , Karnataka
02.2019 - 08.2021
  • Architected Kubernetes platforms and GitOps pipelines for regulated clients; improved deployment frequency 3–5x and achieved 99.9% availability with automated rollbacks and policy-as-code.
  • Standardized Terraform landing zones across AWS/Azure; cut environment provisioning to 1–2 days and improved change lead time by 60–70%.
  • Instituted secure SDLC and image governance (private registry, curated base images, SBOM scanning); reduced critical CVEs by ~50% in 90 days and accelerated SOC2/ISO/PCI audit readiness.
  • Implemented observability (Prometheus/Grafana, ELK) with SLIs/SLOs and incident playbooks; reduced false positives by ~30% and improved MTTR by ~40%.
  • Ran architecture workshops with client stakeholders; aligned platform roadmaps with business OKRs and accelerated modernization by 2 quarters.
  • Managed 10–15 engineers; established golden paths, quality gates, and runbooks that improved predictability and onboarding speed.

Sr. System Administrator

Wipro Limited
01.2017 - 01.2019
  • Partnered with multidisciplinary teams to implement tailored IT solutions aligning with organizational objectives.
  • Completed reports detailing network and systems performance and downtime issues.
  • Resolved issues and escalated problems with knowledgeable support and quality service.
  • Minimized downtime through proactive monitoring and prompt resolution of issues.
  • Improved end-user satisfaction by providing responsive technical support and addressing concerns efficiently.
  • Enhanced vendor partnerships to optimize procurement strategies.

Sr. System Administrator (

SK Vedainfo Pvt Ltd. Offrole Infosys
01.2016 - 12.2017
  • Collaborated with cross-functional teams to design and deploy customized IT solutions, meeting specific organizational needs.
  • Completed reports detailing network and systems performance and downtime issues.
  • Reduced downtime with proactive system monitoring and timely issue resolution.

Sr. Linux Administrator

Unique Infoways Pvt Ltd
09.2013 - 03.2015

System Administrator

Sai Infosystem (India) Ltd
01.2012 - 09.2013

Linux Administrator

Pearson Education Services
10.2010 - 06.2011

Associate Customer Engineer

HCL Infosystems Ltd
10.2006 - 10.2010

Education

B-Tech - Computer Science & Engineering

Rajasthan University

Diploma - CS Engg

Govt Polytechnic Kashipur
Roorkee

Skills

EKS Clusters

undefined

Accomplishments

  • Achieved my products deployed through Argo effectively helping .
  • Recently migrated 200+ all our applications and ELK pipelines to Kubernetes without any downtime to the clients.
  • Awarded by the management for all above achievements along with other products lifecycle managements

Timeline

Sr DevOps Manager

Transunion
08.2021 - Current

Solution Architect (DevOps)

Accenture
02.2019 - 08.2021

Sr. System Administrator

Wipro Limited
01.2017 - 01.2019

Sr. System Administrator (

SK Vedainfo Pvt Ltd. Offrole Infosys
01.2016 - 12.2017

Sr. Linux Administrator

Unique Infoways Pvt Ltd
09.2013 - 03.2015

System Administrator

Sai Infosystem (India) Ltd
01.2012 - 09.2013

Linux Administrator

Pearson Education Services
10.2010 - 06.2011

Associate Customer Engineer

HCL Infosystems Ltd
10.2006 - 10.2010

B-Tech - Computer Science & Engineering

Rajasthan University

Diploma - CS Engg

Govt Polytechnic Kashipur
Ravindra PrasadSr DevOps Manager