Summary
Overview
Work history
Education
Skills
Certification
Timeline
Generic

Yaswanth Vegesna

Summary

An SRE-DevOps professional with possessing a proven ability to administer, control the operations, deployments & maintenance of cloud-based as well as on-prem information systems under different domains. A good communicator, can relate well with people at all levels of SDLC and has flexibility of work well as part of a team and a sole contributor as well.

Overview

11
11
years of professional experience
2011
2011
years of post-secondary education
1
1
Certification

Work history

Site Reliability Engineer

Wipro
Bengaluru, Karnataka
09.2024 - Current

Project: Optum SRE-Consumer-Digital

  • Orchestrated full-stack observability with Datadog, integrating APM, infra monitoring, logs, RUM, synthetic monitoring, and custom metrics for holistic visibility.
  • Engineered SLI/SLO frameworks with burn-rate based alerts and implemented Monitoring as Code using Terraform.
  • Established Kubernetes observability with readiness/liveness probes, scaling metrics, and cluster health validation.
  • Implemented database observability for Oracle and Redis observability for cache performance, latency, and replication metrics.
  • Automated Datadog alerting and SLO monitoring using Terraform scripts, with GitHub Copilot accelerating the creation of burn rate alerts, Kubernetes observability, and service-level dashboards—enhancing reliability and reducing manual configuration overhead.
  • Deployed RUM and Synthetic Monitoring to validate real user experience and simulate critical business transactions.
  • Integrated Azure Application Gateway logs into Datadog for centralized traffic monitoring and anomaly detection.
  • Automated incident triage, routing, and escalation using Datadog–PagerDuty–ServiceNow workflows, reducing MTTR.
  • Utilized Splunk for application-level monitoring and log analysis across multiple services, enabling proactive issue detection, performance optimization, and real-time alerting.
  • Conducted postmortems (RCAs) and drove reliability roadmaps aligned with error budgets.
  • Implemented progressive delivery (canary & blue/green deployments) tied to error budgets for safe rollouts.
  • Developed security observability ( Application-gateway , firewall events, API auth failures) in Datadog.
  • Leveraged Datadog Watchdog (AI anomaly detection) for proactive service health monitoring.
  • Mentored teams on observability-driven development, SRE practices, and incident automation.
  • Integrated Azure Application Gateway logs into Datadog for centralized traffic monitoring and anomaly detection.

SRE & DevOps Engineer

Allstate india pvt ltd
Bangalore
07.2017 - 08.2024
  • Enterprise-wide support for all CI/CD tools such as Cloudbees-Jenkins, Github, Artifactory, Octopus-Deploy, and SonarQube
  • Developed and maintained Terraform templates for AWS resources
  • Managed Kubernetes clusters, ensuring high availability and scalability; implemented rolling updates and auto-scaling
  • Streamlined CI/CD pipeline for Java applications using Maven, Gradle reducing build and deployment times
  • Helped to enhance monitoring of production environment to ensure speedy resolution of issues with E commerce Search Optimization and recommendation APIS
  • Ensuring full feed and delta feed processing through datadog events management
  • Leading a team of six people in different time zones to provide continuity and on-call support
  • Backup and snapshot creation, restoration activities from AWS S3 for DR server setup
  • Configuration and patch management using ansible playbooksinfrastructure while managing the SLI, SLO and SLA to meet all the Site reliability criteria
  • Deployed the microservices using docker compose in Podman containers and to EKS services
  • Implemented Role-Based Access Control (RBAC) policies and user management to secure OpenShift environments

Build and release engineer

IBM
01.2015 - 06.2017
  • IFIXES and generated fix packs for each release
  • Updated source copyrights Set up build environments for specific versions, managed user password changes Cloned and set up development build servers Configured Appmake and Mainwin
  • Set up UNIX and Windows build machines Set up Jenkins for continuous integration builds Maintained APPMAKE;
  • created project-specific profiles Deployed EAR/WAR applications across various environments
  • Created and set up workspaces for releases.

Education

B-tech -

Gayatri Vidya parishad college of engineering

Skills

  • Cloud computing expertise: AWS
  • Monitoring and Logging: Prometheus with Grafana, Datadog
  • Containerization services: Docker, EKS
  • Scripting Language: Python, Shell scripting
  • Incident management Tools: ServiceNow
  • API Development tool: Postman
  • Infrastructure Automation: Terraform
  • Linux Environments: RHEL, Centos
  • Source and Version control tools: Git
  • Configuration management: Ansible
  • CI/CD tool: Jenkins, Github actions

Certification

  • Certified Kubernetes Administrator

Timeline

Site Reliability Engineer

Wipro
09.2024 - Current

SRE & DevOps Engineer

Allstate india pvt ltd
07.2017 - 08.2024

Build and release engineer

IBM
01.2015 - 06.2017

B-tech -

Gayatri Vidya parishad college of engineering
Yaswanth Vegesna