Summary
Overview
Work History
Education
Skills
SOFT SKILLS:
DECLARATION:
Timeline
Generic

SVP SARADHI B

Site Reliability Engineer II
Bengaluru

Summary

Proactive Site Reliability Engineer II with 4 years of experience driving automation and operational efficiency at Mobile Premier League. Expert in managing Kubernetes environments and cloud infrastructure on AWS and GCP. Skilled in scripting with Python and Bash to streamline workflows and enhance system reliability. Experienced in building and maintaining robust CI/CD pipelines, ensuring seamless software delivery. Strong collaborator dedicated to maintaining high availability and security of critical gaming services.

Overview

4
4
years of professional experience

Work History

Site Reliability Engineer II

Mobile Premier League (MPL)
05.2021 - Current

CI/CD & Deployment

๐Ÿš€ Development & Automation

  • Evaluated new technologies and tools to enhance overall system performance, stability, and security.
  • Developed custom scripts/tools using Python and Bash to automate routine tasks, increasing team productivity.
  • Collaborated with cross-functional teams (Dev, QA, Infra) to develop, test, and deploy scalable microservices.
  • Automated Kafka topic creation and ACLs using Jenkins, improving speed and reducing manual errors.
  • Managed microservices in Kubernetes (GKE), ensuring smooth rollout, high availability, and health checks.
  • Wrote and updated internal documentation and runbooks to help team members troubleshoot and respond faster.
  • Built and maintained CI/CD pipelines using Jenkins, Git, and ArgoCD for safe and automated deployments.
  • Used Harness to simplify application delivery across multiple environments and reduce deployment errors.
  • Used Docker to containerize applications for consistent development, staging, and production workflows.
  • Employed Terraform and Ansible for infrastructure provisioning and server configuration across AWS and GCP.
  • Integrated AWS CodeDeploy into the CI/CD process for EC2-based canary deployments, enabling gradual rollout and rollback strategies to minimize production risk.
  • Managed and monitored canary deployments on EC2 using CodeDeploy hooks and lifecycle events to ensure safe and controlled feature releases.

๐ŸŒ Infrastructure & Configuration Management

  • Managed Compute Engine instances, including setup, startup scripts, firewall rules, and monitoring.
  • Handled IAM roles and policies in both AWS and GCP to ensure secure and least-privilege access.
  • Tuned Linux systems (CPU, memory, disk I/O) for performance and stability in production environments.
  • Managed Cloudflare for DNS, SSL, and caching to boost website speed and protection against DDoS attacks.

๐Ÿ“Š Monitoring & Observability

  • Used New Relic, Grafana, and VictoriaMetrics for real-time monitoring, alerting, and dashboarding.
  • Set up Google Cloud Monitoring and Logging to track GCP infrastructure performance and logs.
  • Integrated Kibana with Elasticsearch for visual log analysis, helping debug issues faster.
  • Created log-based alerts with ElastAlert, sending real-time alerts via Slack/email for critical events.

โš ๏ธIncident Management & Reliability

  • Monitored 24/7 live gaming platforms like Rummy, Fantasy, Poker to ensure high uptime and performance.
  • Handled incident response using PagerDuty and Zenduty, ensuring quick resolution of production issues.
  • Improved on-call processes with clear escalation paths and alert tuning to reduce noise and fatigue.
  • Conducted Root Cause Analyses (RCA) after incidents to document issues, fixes, and long-term improvements.
  • Created troubleshooting documentation and issue resolution steps to improve team readiness and reliability.
  • Worked closely with QA and Dev teams during go-live events, monitoring performance and system stability post-deployment.

Education

Bachelor of Technology - Computer Science

Ananthalakshmi Institute of Technology & Sciences
Anantapur, India
04.2001 -

Skills

โ˜๏ธ Cloud Platforms:

AWS (EC2, S3, RDS, IAM, CodeDeploy), Google Cloud Platform (GKE, Compute Engine, Cloud Monitoring)

๐Ÿ’ป Programming & Scripting:

Python, Bash

๐Ÿ–ฅ๏ธ Operating Systems:

Linux (Ubuntu, RHEL)

โš™๏ธ Infrastructure as Code & Configuration:

Terraform, Ansible, Zookeeper

๐Ÿณ Containers & Orchestration:

Docker, Kubernetes (EKS/GKE)

๐Ÿ” CI/CD & DevOps Tools:

Jenkins, Harness, Git

๐Ÿ“Š Monitoring & Logging:

New Relic, Grafana, Kibana, Google Cloud Monitoring

๐Ÿง  Collaboration & Project Management:

Jira, Confluence

SOFT SKILLS:

  • Active Communicator, Confident and enthusiastic, Quick Learner and keen observer, Good at problem solving & decision making, Active team player, good at time management.
  • Languages โ€“ English, Telugu, Hindi
  • Interests โ€“ Cooking

DECLARATION:

I hereby declare that the above mentioned information is correct up to my knowledge and I bearthe responsibility for the correctness of above mentioned particulars.

Timeline

Site Reliability Engineer II

Mobile Premier League (MPL)
05.2021 - Current

Bachelor of Technology - Computer Science

Ananthalakshmi Institute of Technology & Sciences
04.2001 -
SVP SARADHI BSite Reliability Engineer II