Summary
Overview
Work History
Education
Skills
Websites
Timeline
Generic

Laxmi Patil

Bengaluru

Summary

Experienced SRE with over 7+ years of expertise in Azure cloud services. Skilled in collaborating with cross-functional teams to ensure smooth service delivery, system performance improvements, and timely issue resolution. Passionate about continuous service improvement and enhancing customer experience through proactive monitoring and innovative solutions.

Overview

8
8
years of professional experience
7
7
years of post-secondary education

Work History

Service Engineer 2

Microsoft Pvt Ltd
Bengaluru
03.2020 - Current
  • Led automation initiatives to mitigate incidents in Azure Storage, reducing manual investigation time and improving response efficiency
  • Built self-healing automation to proactively detect and resolve hardware failures with zero customer impact
  • Utilized Geneva and Jarvis dashboards to analyse and monitor the performance of entire Azure Storage tenants, focusing on metrics like throughput, IOPS, and latency
  • Ensured the maintenance of SLOs/SLIs by continuously monitoring these metrics and detecting any deviations from expected performance levels
  • Configured alert triggers for anomalies, notifying the relevant tenant owners and enabling swift corrective actions to prevent service degradation
  • Contributed to KPI tracking by helping to reduce MTTR and enhance overall system availability based on real-time performance data
  • Spearheaded SKU validation for Azure Storage tiers (Standard, XIO, Premium V2) by overseeing end-to-end validation of pilotfish tenants before customer traffic release
  • Managed qualification of hardware components (HDD, SSD, CPU, HBA), ensuring compliance with performance and reliability standards
  • Qualified more than 1500 HDD/SSD drives for various vendors
  • Automated hardware qualification workflows using Azure Logic Apps, Data Factory, Databricks (Python), and PowerShell, reducing manual effort by 800+ hours in 8 months
  • Drove incident mitigation strategies using Geneva automation workflows, significantly reducing ticket count and human intervention
  • Actively participated in on-call rotations, managing Sev2 incidents and leading cross-functional collaboration to resolve issues by engaging multiple stakeholders across teams
  • Investigated data unavailability incidents and optimized incident resolution to minimize downtime
  • Managed racks across 100+ data centers, coordinating technician access and hardware repairs
  • Proficient in jarvis for log aggregation, monitoring, and alert correlation
  • Designed custom alerting solutions to proactively detect anomalies and improve Mean Time to Resolution (MTTR)
  • Led postmortem reviews, applying root cause analysis (RCA) methodologies to drive system improvements
  • Managed Azure Storage capacity, orchestrating customer account migrations and recovery of unhealthy servers
  • Collaborated with multiple teams to monitor Jarvis dashboards, track severity alerts, and ensure proactive mitigations
  • Onboarded and mentored new hires, providing knowledge transfer sessions to enhance team efficiency
  • Influenced cross-functional teams by standardizing automation best practices and fostering a culture of operational excellence

Site Reliability Engineer

Mindtree
Hyderabad
07.2017 - 03.2020
  • Analyze high-capacity stamps/clusters/tenants and determine the root cause for high capacity, further mitigate the capacity anomalies by working on parameters triggered anomaly
  • Identifying ZRS migrations
  • Worked using Power BI reports and Jarvis reports
  • Connect with Azure datacenter technician and perform hardware troubleshooting required to recover rack level issues
  • Performed PSU activities, SAS cable replacement and De-Energizing complete rack
  • Hands-on experience on Datacenter maintenance, Site service teams, PDU, Blade Replacement by contacting vendors
  • Recovery of faulty nodes/servers hosted in Azure datacenter using PowerShell through fabric controller
  • Creating TSG’s and modifying them
  • Attending the team-calls on a regular basis to discuss the productivity and the other parameters of the team with top level management and publishing report for same
  • Worked on Deployment blockers like unblocking of STG and XOS versions on stamps/clusters/tenants
  • Ensure availability of resources and perform the datacenter operations, Resolving/Escalating the issues within defined SLA
  • Automation of manual mundane activities using PowerShell scripting

Education

B. TECH - Computer Science Engineering

Appa Institute of Engineering And Technology- VTU University
India
01.2013 - 01.2017

Intermediate - MBPC - MATHS, BIOLOGY, Physics, Chemistry

Shree Guru PU College
India
01.2011 - 01.2013

SSLC - All subjects

St Joseph Convent School Gulbarga
India
01.2010 - 01.2011

Skills

  • Azure Storage

  • Python

  • PowerShell

  • Azure DevOps

  • GitHub Actions

  • Geneva

  • Grafana

  • Azure Logic Apps

  • Data Factory

  • Databricks

  • Understanding of Deep Learning/Machine Learning concepts

Timeline

Service Engineer 2

Microsoft Pvt Ltd
03.2020 - Current

Site Reliability Engineer

Mindtree
07.2017 - 03.2020

B. TECH - Computer Science Engineering

Appa Institute of Engineering And Technology- VTU University
01.2013 - 01.2017

Intermediate - MBPC - MATHS, BIOLOGY, Physics, Chemistry

Shree Guru PU College
01.2011 - 01.2013

SSLC - All subjects

St Joseph Convent School Gulbarga
01.2010 - 01.2011
Laxmi Patil