
Site Reliability Engineer with 7+ years of experience in cloud-native and hybrid infrastructures. Expertise in building, monitoring, and optimizing resilient systems using Red Hat OpenShift, Azure, and Kubernetes. Proven ability to diagnose production issues and implement observability best practices, ensuring high availability and security compliance.
· Deliver L2 operations support for the Payments platform including issue investigation, escalation, and resolution in alignment with SLA and compliance requirements.
· Apply SRE principles and techniques such as automation, observability, and continuous improvement to enhance reliability and operational efficiency of payment services.
· Monitor application health and performance through proactive alerting, dashboards, and logs to ensure high availability and quick response to production issues.
· Manage incidents end-to-end — including triage, root-cause analysis, coordination with engineering teams, and communication of impact and recovery progress.
· Contribute to resilience improvements by identifying repeated failures, optimizing runbooks, and implementing preventative measures to reduce operational risk.
• Drive day-to-day BAU operations across Red Hat Openshift, PCF, and Azure, ensuring platform stability, security, and optimal performance
• Spearhead design and deployment of OpenShift clusters in both bare-metal
and VM-based environments
• Lead application workload migration from PCF to OpenShift, ensuring seamless transition with minimal downtime
• Manage platform resources on OCP including Persistent Volume creation, role- based access modifications, and Operator installations
• Partner with the Observability team to develop detailed Grafana dashboards for real-time platform monitoring
• Build and maintain Splunk dashboards for proactive health checks and alerting
• Perform PCF foundation upgrades and Diego cell scaling to improve system performance
• Deliver Kubernetes-based platform support managing configurations and incident triage
• Collaborate with cross-functional teams to onboard workloads into hybrid cloud environments
• Monitor issues flagged by down detectors, engaging in troubleshooting sessions
• Provided L3 support for environment and production-related issues
• Analyzed code and made modifications to existing software
• Handled emergency recovery support interacting with clients
• Worked on script filling/refilling, adjudication of scripts, and drug utilization review functionalities
• Created alerts in Splunk and AppDynamics to support application availability
• Led a team of 4 in providing system support and troubleshooting
• Analyzed issues caused by major events with SRE techniques
• Developed complex SQL queries for report generation
• Created metrics dashboard and alerts in AKS
• Migrated applications from on-premises to Azure
• Successfully migrated Windows and Linux Servers from on-premises to Azure
• Executed migration of SQL Servers from on-premises to Azure
• Managed and troubleshot replication and migration issues
• Conducted discovery and assessment of on-premises servers
• Played a role in the construction and configuration of Virtual Machines
• Created and implemented VNETs and NSG rules
• Contributed to Identity and Access Management
• Managed traffic at the server level to ensure optimal performance
• Implemented scalable software solutions using agile methodologies
• Collaborated with cross-functional teams for feature design and deployment
• Conducted code reviews and debugging sessions
• Successfully migrated applications to the Cloud and deployed them through PCF
• Led design and migration of applications to Microservices
• Facilitated Scrum calls for project updates and coordination
• Provided L3 Support for Environment and Production
• Implemented Shell scripts for production maintenance
• Led a team of junior software engineers in successful project delivery
Star of the Quarter Star of the Month Pat on the Back
Awarded Star of the Quarter for standout performance in System Analyst role and building strong customer relationships
Received Star of the Month twice for outstanding performance and technical analysis on critical issues appreciated by clients
Continuously awarded Pat on the Back for hard work, diligence, and inspiration to the team