Results-driven SRE L1 Support Engineer with expertise in monitoring and incident management at Photon Interactive Pvt. Ltd. Proficient in OpsGenie and SQL, I excel in resolving production issues and enhancing service reliability. Strong analytical skills and collaboration with DevOps teams have consistently improved system performance and uptime.
Overview
3
3
years of professional experience
Work History
SRE L1 Support Engineer
Photon Interactive Pvt.Ltd.
Bangalore
09.2019 - 10.2021
Monitoring & Incident Management Actively monitored system health, infrastructure alerts, and application dashboards using tools like CloudWatch, Grafana, Nagios, Prometheus, or Datadog.
Responded to alerts and incidents, performed initial investigation, and escalated to L2/L3 teams when required.
First-Level Troubleshooting Handled day-to-day issues with servers, databases, and applications (e.g., restarting services, checking logs, verifying connectivity).
Resolved recurring user-reported issues by following runbooks/standard operating procedures (SOPs).
Ticket Handling & Support Managed incoming tickets related to infrastructure, deployments, and system performance.
Ensured SLAs were met by prioritizing critical incidents and providing timely updates.
System Operations Performed routine checks on Linux/Unix servers, AWS resources, and databases to ensure availability.
Assisted with scheduled maintenance activities like patching, backups, and verifying system health post-deployment.
Collaboration & Documentation Escalated issues with detailed troubleshooting notes for faster resolution by senior engineers.
Updated knowledge base, runbooks, and incident reports for future reference.
Learning & Growth Gained exposure to DevOps/SRE practices like automation, CI/CD pipelines, and cloud resource monitoring.
Assisted in automating repetitive support tasks under guidance from senior SREs.
Associate Software Engineer
Honywell Technology Pvt. Ltd.
Bangalore
11.2018 - 06.2019
Prepared daily production reports and shared with stakeholders.
Integrated and acknowledged alerts using OpsGenie and monitored performance alerts in AppDynamics.
Analyzed log files to identify errors, performance issues, and root causes.
Worked closely with DevOps & DBA teams for incident resolution and performance optimization.
Extracted and validated KPI data from databases for availability and performance tracking.
Calculated Availability KPIs and metrics to measure service reliability.
Generated Service Availability Reports using Dexter and shared insights with management.