Results-driven IT professional with 7 years of overall experience, including 4 years in Production Support within the banking and financial services domain. Skilled in ensuring high availability and stability of mission-critical applications, with hands-on expertise in incident management, monitoring, troubleshooting, and root cause analysis. Experienced in working with tools such as Grafana, Kibana, AppDynamics, ServiceNow, Jira, and Confluence. Proficient in SQL troubleshooting, Linux-based application support, and Kubernetes environment management. Adept at collaborating with cross-functional teams to resolve complex issues, improve system performance, and deliver seamless customer experience. Known for strong problem-solving skills, attention to detail, and the ability to perform under pressure in production environments.
Project: Banking Domain (First Citizens Bank)
Provided L2/L3 production support for core banking applications, ensuring uninterrupted functioning of NEFT, IMPS, RTGS transfers, customer account services, and transaction APIs.
Participated in daily stand-up calls, giving progress updates on incident resolution, monitoring alerts, and assigned tasks.
Monitored application health through Grafana, Kibana, and AppDynamics dashboards:
Grafana → Response time, CPU, memory, uptime, and system performance metrics.
Kibana → Application log analysis for errors and root cause identification.
AppDynamics → End-to-end transaction monitoring and performance bottlenecks.
Proactively monitored and resolved alerts related to CPU utilization, memory usage, disk space, and API error rates.
Performed application and server troubleshooting in Linux-based environments, including log analysis, process health checks, and server availability validation.
Investigated Kubernetes pod failures (CrashLoopBackOff), restored service availability, and implemented monitoring enhancements.
Diagnosed and resolved SQL blocking and deadlock issues in MS SQL Server by analyzing query performance and coordinating with DBAs for optimizations.
Validated post-resolution system stability and SLA compliance using Postman API testing.
Worked on ServiceNow for incident, problem, and change management in production environments.
Used Jira for bug/issue tracking, ticket assignment to developers, and progress tracking of production defects.
Maintained SOPs, runbooks, and RCA documentation in Confluence for knowledge sharing and operational readiness.
Collaborated with the DBA, DevOps, and Development teams to drive long-term fixes and performance improvements.
Project: Consumer & Community Banking (ATM Reconciliation Services)
Operating System : Linux, Windows
undefinedI hereby declare that the above mentioned details are correct up to my knowledge and I bear the responsibility for the correctness of the above mentioned particulars.
SHAIK IRFAN BASHA