Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
Sneha Kumari

Sneha Kumari

Site Reliability Engineer
Pune

Summary

Results-driven Site Reliability Engineer with 3.5 years of experience in enhancing system reliability and automating processes in cloud and on-prem environments. Expertise in managing incidents, cloud migrations, and infrastructure monitoring. Committed to improving system stability and observability through collaboration with development teams.

Overview

6
6
Certifications
4
4
years of professional experience

Work History

Site Reliability Engineer

SLB
09.2022 - Current
  • Built a Dynatrace SRE Assist tool using Microsoft Copilot Studio (Agentic AI) with custom connectors, enabling teams with limited monitoring knowledge to easily check application health, active problems, availability, logs, traces, and usage insights.
  • Led the Windows OS upgrade (2012 to 2019) for over 200 Azure servers using Terraform, with smooth cutovers.
  • Worked on migrating on-premises servers to Azure with minimal downtime.
  • Managed Dynatrace monitoring setup, including synthetic, RUM, SSL, SLI/SLO, and dashboard creation, while overseeing Azure infrastructure provisioning, scaling, and maintenance.
  • Regularly participated in Disaster Recovery (DR) drills for all applications, ensuring readiness for unforeseen outages, and validating recovery procedures.
  • Built and maintained CI/CD pipelines. Automated tasks using Python and Bash, reducing manual effort.
  • Used Splunk and FireFlow for network troubleshooting.
  • Managed incidents end-to-end through ServiceNow and Remedy, and conducted root-cause analyses after major incidents to identify areas for process improvement, or technical enhancement opportunities.
  • Optimized resource utilization across cloud-based infrastructure environments, leading to cost savings.
  • Mentored junior engineers on best practices for system reliability and incident response strategies.
  • Implemented automated monitoring solutions to enhance system performance and reliability.
  • Analyzed system metrics to enhance the availability and scalability of services.
  • Collaborated with cross-functional teams to troubleshoot and resolve production incidents promptly.
  • Developed and maintained infrastructure as code using Terraform for efficient resource management.
  • Achieved significant cost reductions by optimizing resource allocation and eliminating wasteful practices.
  • Contributed to an accelerated patching effort to install monthly patches within defined maintenance windows. Non-prod systems were patched after Microsoft’s monthly patch release (the second Tuesday), followed by validation before proceeding with production patching.
  • Worked toward ensuring DLO compliance across multiple applications by continuously tracking vulnerabilities, coordinating fixes, and performing regular remediation to meet security and audit standards.

Trainee Software Engineer

Dealermatrix
01.2022 - 08.2022
  • Developed and implemented software features using Salesforce technology to enhance application functionality.
  • Collaborated with internal teams to gather requirements, and ensure alignment with project objectives.
  • Conducted code reviews, ensuring adherence to best practices and improving code quality across projects.
  • Assisted in debugging and troubleshooting issues, contributing to a seamless user experience for applications.

Education

B.E - Computer Science

Chandigarh University
Mohali, India
04.2001 -

Higher Secondary School - PCM

Jawahar Navodaya Vidyalaya
Patna, India
04.2001 -

Skills

Incident management

Infrastructure automation

Containerization technologies

System monitoring

Log analysis

Network troubleshooting

Web server administration

Security best practices

Disaster recovery

Microservices architecture

SSL Certificate management

Load Balancer

Certification

Microsoft AZ 900

Timeline

Site Reliability Engineer

SLB
09.2022 - Current

Trainee Software Engineer

Dealermatrix
01.2022 - 08.2022

B.E - Computer Science

Chandigarh University
04.2001 -

Higher Secondary School - PCM

Jawahar Navodaya Vidyalaya
04.2001 -
Sneha KumariSite Reliability Engineer