Proactive and detail-oriented Site Reliability Engineer with 4+ years of experience in managing infrastructure reliability, incident response, system monitoring, and automation. Adept at working in high-pressure 24x7 production environments, on-call rotations, and root cause analysis of critical incidents. Experienced with monitoring tools like Icinga, Grafana, and AppDynamics, and process automation using PowerShell scripting. Passionate about driving stability, reducing manual tasks, and transitioning into a more DevOps-focused role.
- Led a seven-member team internally in production support operations, ensuring system uptime through proactive monitoring and issue resolution.
- Collaborated with the DBA team to automate a manual payment processing checklist, saving approximately 7,300 minutes per year in effort.
- Monitored critical systems using Grafana, Icinga, AppDynamics, Control-M, and Supervisor-D; provided patch clearance, and system health updates.
- Took ownership of P1/P2 incidents, executed rapid triaging and RCA, and ensured zero business impact during on-call rotations.
- Maintained documentation, tracked anomalies (e.g., UPS, NetBotz), and coordinated SNMP configurations with vendors and clients.
- Reduced incident noise by effectively managing parent-child incident tagging to group false alerts.
- Wrote PowerShell scripts for basic infrastructure management tasks, and participated in daily Scrum to report and plan workloads.
- Supported the successful go-live of the Carlyle project, actively participating in initial planning and deployment coordination.
Operating Systems: Window, Linux
Monitoring & Alerting: Grafana, Icinga, AppDynamics, Control-M, Supervisor-D
Ticketing & ITSM: ServiceNow, OpsGenie, JIRA
Virtualization: VMware vSphere, Citrix, RDP
Cloud Platforms: Windows VM (hosted via VMware/Citrix)
RDBMS: MS SQL Server, SSMS
Programming Language: C, C, Python, Java, SQL
Scripting languages: Bootstrap, CSS, PHP, PowerShell, SQL
Process: System Monitoring, Incident Management, ITIL framework, Capacity planning, RCA, On-Call Rotation, Daily Standups
Other Utility Tools: Putty, WinSCP
Soft Skills: Leadership, Cross-functional collaboration, Crisis handling, Communication
Microsoft Azure Fundamentals - AZ900