Proven Site Reliability Engineer with expertise in cloud platforms like AWS and Azure, previously at Arka IT Solution. Skilled in optimizing Java applications and enhancing system performance. Demonstrated leadership in cross-functional collaboration, reducing incident resolution time by 30%. Strong communicator with a proactive approach to problem-solving and continuous improvement.
Overview
15
15
years of professional experience
1
1
Certification
Work History
Site Reliability Engineering(SRE)
Arka IT Solution
Irving
05.2024 - 08.2024
Responsible Day to Day Application support activity.
Responsible Morning Health Check Up.
Responsible Application is up and running 24
7.
Responsible C2W if needed when Application service is going down.
Coordinate all build and release activities, ensure release processes is well documented, source control repositories including branching and tagging.
Responsible for Sev-1,Sev-2,Sev-3 and Sev-4 Incident.
Responsible to monitor and available critical service 24
7 production support.
When required to join the C2W call and work with other stack holder team in Application production support environment.
Responsible to Regular production Deployment like as Emergency deployment weekly deployment major release etc.
Deploy the code in cloud platform azure and AWS.
Join the weekly root cause call and discuss the issue to other stack holders.
Monitor the Daily Basis onshore and offshore application around 40-60 application Morning health check up and send the report to other stack holder and management.
Deploy the application to weblogic production server and involve the pre and post deployment issue.
Develop, maintain, and optimize Java-based applications to meet evolving system and business needs. Troubleshoot and debug issues in production, ensuring minimal downtime and disruption to operation.
Provide continuous support for the applications, ensuring optimal performance, security, and compliance with data processing requirements.
Proactively monitor application performance, log files, and infrastructure health using tools like Prometheus, Grafana, or other monitoring solutions. Quickly resolve issues by troubleshooting, analyzing logs, and collaborating with development and operations teams.
Collaborate with cross-functional teams, including business stakeholders, developers, and infrastructure teams, to resolve production issues, deploy new releases, and ensure application reliability.
Engage in performance tuning and optimization of Java applications and database queries, contributing to overall system performance and scalability improvements.
Validate and manage Application Health issues, ensuring that all incidents involving sensitive data are handled according to compliance standards and regulations.
Experience creating pods and clusters in Kubernetes and deploy those using OpenShift. Good understanding of OpenShift platform in managing Docker containers and Kubernetes Clusters.
Worked in container based technologies like Docker, Kubernetes and Openshift.
Point team player on Openshift for creating new Projects, Services for load balancing and adding them to Routes to be accessible from outside, troubleshooting pods through ssh and logs, modification of Build configs, templates, Image streams, etc.
Used Docker, Kubernetes and OpenShift to manage micro services for development of continuous integration and continuous delivery.
Worked on the Deployment, Configuration, Monitoring and Maintenance of OpenShift Container Platform.
Knowledge of monitoring and configuring tools like Introscope, Jenkins, BMC Remedy, Patrol and Netcool.
Handling and generating tickets via the BMC Remedy ticketing tool.
Expertise in Performance Monitoring and Performance Tuning using Top, prstat, sar, vmstat, ps etc.
Experience in Troubleshooting of Network, OS and Performance related problems.
Experience in Solving Memory and CPU utilization problems.
Experience in System Administration, installation of Patches and Troubleshooting related to the same.
Communicated and Coordinated with customers internal/external for resolving issues for reducing downtime.
Good knowledge of Change Management process.
Experience in installing software's packages like FTP, DNS using Red hat Package Manager.
Supported after-hours to meet deadlines as well as for operational support as required.
Responsibilities to solve technical Problems related System administration (Linux of Our Clients). Maintaining and Troubleshooting of FTP Server, Samba Server of the client.
Coordinating with apps & database teams to apply patching, Managing SAN environment from the Linux point of view, managing Physical and Logical volumes.
Handling the day-to-day Operations, Install software, apply patches, manage file systems, monitoring performance and troubleshoot alerts.
Leading and driving of technical conference bridges, Root Cause determination, Severity Level Assessment, Change management and report directly with Executives, Senior Sustain Managers, and all other stakeholders.
To make sure the all Critical and Major Incidents are addressed within SLA response and sending out the Critical Alerts for such Incidents.
Ensured to be able to successfully bring down the MTTR (Mean time to resolve) for all categories of incidents.
To manage and support all service incidents either personally or via the Service Desk, through to successful completions and user satisfaction.
To regularly review, performance and trends in response to incidents and to provide recommendations to the Service Manager for service improvement. The emphasis is on swift resolution of critical incidents, which have severe business implication or have the potential of causing disruptions / unavailability.
Sr. Application Support Analyst
Citi Bank - Ltimindtree
TX
09.2022 - 04.2024
Responsible for day to day technical issues of 140+ clients.
Identified system data issued using oracle, sql, , xml, java client Citi Bank API Application.
Participate in root causes analysis for service failure and provide resolution.
Provides senior level system analysis, design, development, and implementattion of applications and databases for client/server-, Web-, and/or PC-based software or middleware.
Provide technical support for our Client Citi Bank application to triage, resolve, and conduct RCA on Sev 1 and Sev 2 incidents.
Interpret technical designs for implementations, or review those created by team members, to deduce why production issues may be occurring.
For issues that require coding changes, develop and technical development that is in line with our established architecture, technical designs, and development standards.
Act as the primary prod support technical contact for our application end users.
Understand the overall product roadmap as articulated by agile coach/product owner and translate roadmap into team specific release planning and sprint planning.
Correct the existing code if required which should be well designed, testable, efficient code.
Monitoring and Responsible to available the critical service 24
7.
Experience creating pods and clusters in Kubernetes and deploy those using OpenShift. Good understanding of OpenShift platform in managing Docker containers and Kubernetes Clusters.
Worked in container based technologies like Docker, Kubernetes and Openshift.
Point team player on Openshift for creating new Projects, Services for load balancing and adding them to Routes to be accessible from outside, troubleshooting pods through ssh and logs, modification of Build configs, templates, Image streams, etc.
Used Docker, Kubernetes and OpenShift to manage micro services for development of continuous integration and continuous delivery.
Worked on the Deployment, Configuration, Monitoring and Maintenance of OpenShift Container Platform.
Knowledge of monitoring and configuring tools like Introscope, Jenkins, BMC Remedy, Patrol and Netcool.
Handling and generating tickets via the BMC Remedy ticketing tool.
Expertise in Performance Monitoring and Performance Tuning using Top, prstat, sar, vmstat, ps etc.
Experience in Troubleshooting of Network, OS and Performance related problems.
Experience in Solving Memory and CPU utilization problems.
Experience in System Administration, installation of Patches and Troubleshooting related to the same.
Communicated and Coordinated with customers internal/external for resolving issues for reducing downtime.
Good knowledge of Change Management process.
Experience in installing software's packages like FTP, DNS using Red hat Package Manager.
Supported after-hours to meet deadlines as well as for operational support as required.
Responsibilities to solve technical Problems related System administration (Linux of Our Clients). Maintaining and Troubleshooting of FTP Server, Samba Server of the client.
Coordinating with apps & database teams to apply patching, Managing SAN environment from the Linux point of view, managing Physical and Logical volumes.
Handling the day-to-day Operations, Install software, apply patches, manage file systems, monitoring performance and troubleshoot alerts.
Responsible for Money transfer systeam and payment support.
Track the fund related issue if user is face like as uber payment support, cross border payment support, UETR payment issue, swift related issue.
Associate Tech Specialist
AT&T Tech Mahindra Ltd.
Noida
04.2015 - 09.2022
Building the CI/CD process from scratch.
Implementing gitlab CI, gitlab, docker, maven.
Migrating from gitlab to docker and implementing gitlab inside docker.
Containerizing the integration process by gitlab CI within docker.
Used Docker containers to quickly deploy linux based applications.
Good Interpersonal Skills, team-working attitude, takes initiatives and very proactive in solving problems and providing best solutions.
Worked in all areas of Jenkins setting up CI for new branches, build automation, plugin management and securing Jenkins and setting up master/slave configurations.
Integrating various Version control tools, build tools, nexus and deployment methodologies (scripting) into Jenkins to create an end to end orchestration build cycles.
Troubleshoot build issues in Jenkins, performance and generating metrics on master's performance along with jobs usage.
Design, develop build and packaging tools for continuous integration build and reportting. Automate the build and release cycles.
Coordinate all build and release activities, ensure release processes is well documented, source control repositories including branching and tagging.
Responsible for Sev-1,Sev-2,Sev-3 and Sev-4 Incident.
Responsible to monitor and available critical service 24
7 production support.
When required to join the C2W call and work with other stack holder team in Application production support environment.
Responsible to Regular production Deployment like as Emergency deployment weekly deployment major release etc.
Deploy the code in cloud platform azure and AWS.
Join the weekly root cause call and discuss the issue to other stack holders.
Monitor the Daily Basis onshore and offshore application around 40-60 application Morning health check up and send the report to other stack holder and management.
Deployed the application to weblogic production server and involve the pre and post deployment issue.
Develop, maintain, and optimize Java-based applications to meet evolving system and business needs. Troubleshoot and debug issues in production, ensuring minimal downtime and disruption to operation.
Provide continuous support for the applications, ensuring optimal performance, security, and compliance with data processing requirements.
Proactively monitor application performance, log files, and infrastructure health using tools like Prometheus, Grafana, or other monitoring solutions. Quickly resolve issues by troubleshooting, analyzing logs, and collaborating with development and operations teams.
Collaborate with cross-functional teams, including business stakeholders, developers, and infrastructure teams, to resolve production issues, deploy new releases, and ensure application reliability.
Engage in performance tuning and optimization of Java applications and database queries, contributing to overall system performance and scalability improvements.
Leverage DevOps tools and methodologies for continuous integration and continuous deployment (CI/CD). Automate build, test, and deployment pipelines to support application delivery.
Support and troubleshoot Postgres SQL and MS SQL Server databases, ensuring data integrity, optimization, and availability.
Oversee and manage the ticket workflow for incidents, especially Priority 4 & 5, ensuring proper tracking, followup, and resolutions within established SLAs.
Initiate and drive T (Technical Operations Calls) when outages occur, prioritizing critical incidents (Priority 1, 2 & 3) to ensure prompt resolution and minimal service disruption.
Manage SSL Certificate Management, ensuring timely renewal and proper configuration to maintain secure communications.
Ensure the operational readiness of systems and processes, particularly during peak periods (e.g., 1/1 or other high-traffic times), to ensure that infrastructure supports increased loads and system reliability.
Assist in the creation and maintenance of Application Recovery Guides (ARGs) and conduct disaster recovery (DR) activities to ensure business continuity in case of major system failures.
Validate and manage Application Health issues, ensuring that all incidents involving sensitive data are handled according to compliance standards and regulations.
Experience creating pods and clusters in Kubernetes and deploy those using OpenShift. Good understanding of OpenShift platform in managing Docker containers and Kubernetes Clusters.
Used Docker, Kubernetes and OpenShift to manage micro services for development of continuous integration and continuous delivery.
Worked on the Deployment, Configuration, Monitoring and Maintenance of OpenShift Container Platform.
Knowledge of monitoring and configuring tools like Introscope, Jenkins, BMC Remedy, Patrol and Netcool.
Handling and generating tickets via the BMC Remedy ticketing tool.
Expertise in Performance Monitoring and Performance Tuning using Top, prstat, sar, vmstat, ps etc.
Experience in Troubleshooting of Network, OS and Performance related problems.
Experience in Solving Memory and CPU utilization problems.
Experience in System Administration, installation of Patches and Troubleshooting related to the same.
Communicated and Coordinated with customers internal/external for resolving issues for reducing downtime.
Good knowledge of Change Management process.
Experience in installing software's packages like FTP, DNS using Red hat Package Manager.
Supported after-hours to meet deadlines as well as for operational support as required.
Responsibilities to solve technical Problems related System administration (Linux of Our Clients). Maintaining and Troubleshooting of FTP Server, Samba Server of the client.
Coordinating with apps & database teams to apply patching, Managing SAN environment from the Linux point of view, managing Physical and Logical volumes.
Handling the day-to-day Operations, Install software, apply patches, manage file systems, monitoring performance and troubleshoot alerts.
System Analyst
AT&T - Tech Mahindra Ltd.
IND
12.2013 - 03.2015
Extensively configured and administered Weblogic Server 9/10 in various LINUX Environments.
Worked with Development, Testing, Staging and Production environments.
Created Weblogic domains, Managed servers, Clusters, Machines and Start up scripts.
Configured JDBC Data Sources that bounded to the J2EE Applications, configured the connection pools for the data sources.
Involved in doing a performance benchmark of Weblogic server by using Load runner.
Created WLST, ANT scripts, and shell scripts to automate the deployment process.
Configured Node manager to remotely administer Managed servers.
Involved in creating and configuring the Clustered platform domain for load balancing and fail over.
Experience on providing maintenance support of Oracle Identity Manager.
Working knowledge on Oracle Identity Manager 9g/11g provisioning and request workflows.
Involved in developing controls like EJB Controls, JMS Controls, AI controls and DB Controls by using WebLogic Workshop IDE.
Managed and Monitored JVM performance by Weblogic Heap Size, garbage collection, JDBC Pools.
Developed many shell scripts to automate the maintenance process of the Weblogic and recover the backed up WebLogic configurations.
Control user access and permission through Security Realms from Administration Console.
Used Pack and Unpack Commands and created Templates.
Installed and configured WebLogic, Apache and Sun One Web Servers during the Data Center Server migration.
Installed and Configured JBoss 6.0 on Dev, Test, PPE and Prod Environments and Provided support.
Configured Connection Pools in JBoss and monitored them.
Configured JBoss Server and provided support for the ERMS application.
Extensively work on integration with oracle access manager NetIQ, Oracle directory server(LDAP), weblogic credential store, Jazn Security Policy designed weblogic monitoring framework and migrations strategy using wlst, ADF and Java.
Have good Knowledge in troubleshooting various issue to cyberark.
Experience with installation and configuration cyberark vault cpm, cyberark, pvwa etc.
Good knowledge in Ldap and involved with ldap integration and adding user with their privileges.
Senior programmer
IGNOU(Indra Gandhi National Open University)
New Delhi
01.2011 - 12.2013
Company Overview: URL :- http://edrp.ac.in
Web Hosting Management System
Technology :- Shell Script, RPMS,Java Script
RDBMS :- mysql
Server :- Red Hat Entrprise server Tomcat 5.5
Objective: Web Hosting Management system (WHMS) is a web based hosting service that allows multiple DNS, Domains, mail server access single account users and organization to create their own website accessible via the World Wide Web. A common Internet service is web hosting. Web hosting means store your website on a public server.
WHMS web hosting allows users to manage different aspects of website, including our files, security, email, web applications.
URL :- http://edrp.ac.in
Proficiency on cPanel/WHM both frontend and backend.
Third Party Scripts and softwares Installation and Configuration.
Apache and MYSQL Server optimization.
Troubleshooting problems related to Apache MYSQL, PHP, Exim, DNS, FTP etc.
Apache compilations.
Network Firewall installation and configurations- CSF, APF, IPTABLES.
Objective: eGyanKosh is an online Repository of content management system project. In this it maintains all Communitities and Collections with its items. It provides better searching techniques and security.
UR :- http://www.egyankosh.ac.in
Involved in the design of the applications using J2EE. This architecture employs a Model/View/Controller (MVC) design pattern.
Implemented MVC architecture using Struts in terms of JSP and Servlets.
Responsibilities involved developing of Action Classes, Form Beans and JSPs.
Written Enterprise Java Beans (EJB) to implement business logic.
Development entails usage of J2EE technologies like Struts, JDBC and JBossApplication Server.
Written JavaScript for validation of page data in the JSP pages.
jr.software programmer
BSA Info media Pvt. Ltd.
New Delhi
01.2010 - 01.2011
BSA Info media is work on various process i.e. vodaphone verification aircell verification, citi bank credit card collection Barclays collection abnamro collection etc.
I am handling the online web base application on sql server 2005 and front end visual studio 2008. We are maintained and modify web base application on client demand.
Support a web application on user base.
Education
M.C.A - computer Application
Annamalai University
India
01.2007
Doeacc 'A' Level -
Affiliated Institute ET&T
New Delhi
03.2005
B.Com - Bachelor in commerce
Delhi University
India
01.2003
Doeacc 'O' Level -
Affiliated Institute ET&T
New Derlhi
07.2002
Skills
Build Tools Ant
Build Tools Maven
Container run times Docker
Container run times Kubernetes
Cloud Platforms AWS
Cloud platforms OpenShift
Cloud Platforms Azure
Web Application Servers Tomcat
Web application servers WebLogic
Web Application Servers Web Sphere Application Server 8
Functional Consultant – SCM at • ARKA Technology Innovation & Software SystemsFunctional Consultant – SCM at • ARKA Technology Innovation & Software Systems