Summary
Overview
Work History
Education
Skills
Timeline
Generic

Sreeya Kovvuru

Hyderabad

Summary

Observability Engineer with a background in implementing and optimizing monitoring and observability solutions using AppDynamics, OpenTelemetry framework. Expertise in deploying agents for applications, machines, and databases to ensure high performance and reliability across distributed environments.

Overview

3
3
years of professional experience

Work History

Observability Engineer

Tata Consultancy Services
Hyderabad
07.2022 - Current
  • Designed and implemented a comprehensive observability stack using OpenTelemetry for telemetry collection, Prometheus for metrics scraping and alerting, and Grafana for real-time dashboard visualization.
  • Configured and maintained OTEL Collectors, Tempo, and Loki to enable full-stack logs, traces, and metrics observability.
  • Developed and deployed custom Prometheus exporters, and implemented application-level custom metrics to enhance monitoring coverage.
  • Created and managed detailed Grafana dashboards for KPIs, infrastructure health, trace analytics, and service performance.
  • Defined and maintained alerting rules in Prometheus; integrated alerting pipelines with PagerDuty, and ServiceNow for incident creation and response.
  • Automated deployment of observability infrastructure components using Terraform, ensuring repeatability, and scalability.
  • Performed basic operations and configuration management of Dynatrace to support monitoring workflows and incident escalation processes.
  • Deployed and managed AppDynamics agents to provide deep visibility into application, infrastructure, and database performance, enabling full-stack monitoring.

POC – Automated Incident Management & Remediation:

  • Implemented a proof of concept using AppDynamics to monitor application errors, batch job failures, and infrastructure errors with threshold-based alerting.
  • Integrated AppDynamics alerts to automatically trigger incident creation in ServiceNow, improving incident management efficiency and traceability.
  • Configured n8n automation workflows to run remediation scripts upon incident creation, reducing manual intervention, and Mean Time to Resolution (MTTR).
  • Collaborated cross-functionally with development, infrastructure, and operations teams to improve observability standards, troubleshoot incidents, and enhance system reliability.

Education

Bachelor of Technology - Electronics And Instrumentation

CVR COLLEGE OF ENGINEERING
Hyderabad
05-2022

Skills

  • Observability & APM - Prometheus, Grafana, AppDynamics, OpenTelemetry, Loki, Tempo
  • Alerting and incident management: Prometheus Alert Manager, Grafana Alerts, ServiceNow, PagerDuty
  • Cloud platforms - AWS
  • Infrastructure and IaC: Terraform, Ansible (basic), Bash, Python (basic)
  • Metrics, logs, and traces - custom exporters, RUM, spans, and synthetic monitoring
  • Automation and scripting - n8n (workflow automation), Bash, Python (basic), soft skills, communication, collaboration, problem-solving, documentation

Timeline

Observability Engineer

Tata Consultancy Services
07.2022 - Current

Bachelor of Technology - Electronics And Instrumentation

CVR COLLEGE OF ENGINEERING
Sreeya Kovvuru