Experienced professional with around 3 years of experience in IT service management (Change management and event management ) and Site reliability engineering in retail domain. Open and clear communicator with good multitasking skills, organized nature and strong attention to detail.
- Worked in SmartIT tool for reviewing and approving CRQs.
- Conducting change management trainings to create CRQs.
- Creating templates for the change management module.
- Created successful test scripts to manage automated feature testing.
- Detected and notified on issues for critical applications and environment.
- Partnered with TOC, ITSM and technology teams to deliver comprehensive monitoring solutions utilising best in class monitoring tools.
- Worked on Incident trend on everyday basis and identifying automation opportunities.
- Leveraging monitoring opportunities for new applications and working with automation teams.
- Experience in retail Site reliability for Lowes.com.
- Day to day maintenance of application systems in operation.
- Identifying and troubleshooting application issues.
- Resolving and escalating issues as needed.
- Implementing automation to improve operational efficiency.
- Optimising system performance to enhance reliability.
- Working on various applications on retail side for inventory, reservation and delivery.
- Monitoring grafana dashboard for API and infra level alerts for all the services.
- Analysing ELK logs to investigate on all the events and errors.
- Creating reports to analyze weekly and monthly spikes or dips to further find automation opportunities.
SmartIT tool
Received spot award twice:
ITIL Certification
SRE Foundation Certification
ITIL Certification