

Senior Linux Infrastructure Specialist with 14 years of experience managing and stabilizing large-scale enterprise production environments (1200+ nodes) across hybrid AWS and on-prem datacenters. Deep expertise in Enterprise Linux (RHEL/SLES/Oracle Linux), SAP ASCS/ERS high-availability clustering, performance engineering, infrastructure reliability, and 24x7 production operations leadership.
Proven track record of delivering 99.97% production availability, reducing MTTR by 73%, improving patch compliance to 98%, and leading global infrastructure teams supporting Automotive, Financial Services, Oil & Gas, and Government platforms. Strong SRE-oriented mindset focused on incident elimination, automation-driven operations, monitoring precision, and continuous platform stability improvement.
KEY IMPACT METRICS
Associate Manager – IT Operations & Infrastructure for Automotive & Sap Landscape
May 2021 – Present | Pune, India
Client: Vitesco Technologies (Germany)
Role Overview
Technical and operational lead for hybrid SAP production environments supporting global manufacturing platforms across AWS and on-prem datacenters.
Key Contributions
Linux Infrastructure & Stability Engineering
• Managed 800+ enterprise Linux servers across global production landscape
• Designed and maintained SAP ASCS/ERS HA clusters (Pacemaker/Corosync)
• Diagnosed CPU, memory, and I/O performance bottlenecks using advanced system tools
• Tuned kernel parameters to optimize SAP & middleware workloads
• Resolved AWS resource contention and performance issues
Reliability & Incident Governance
• Reduced MTTR by 73% through structured triage & escalation model
• Reduced repeat P1/P2 incidents by 52% via deep RCA and preventive controls
• Led P1 war-room bridges for critical SAP outages
• Conducted DR simulations validating RTO/RPO compliance
• Improved monitoring precision through alert threshold optimization
Operations Leadership
• Led 12-member global infrastructure team (24x7 operations)
• Designed on-call rotation & escalation matrix
• Published SLA dashboards & availability reports
• Standardized runbooks and operational SOPs
• Coordinated OEM vendors (AWS, SUSE, SAP, Dell, Cisco)
Automation & Compliance
• Automated patching & configuration management via Ansible & Satellite
• Increased patch compliance from 68% to 98%
• Reduced manual operational effort through automation frameworks
• Optimized AWS compute spend by 15%
Environment: AWS, RHEL, SLES, SAP HA, WebLogic, Tomcat, Oracle DB, VMware, Ansible
Client: TotalEnergies (France).
Client: Oppenheimer Funds, Inc. (United States)
(First 6 months worked as a contractor with Teamware Solutions from June 17, 2016, to January 10, 2017.)
Client - CSIR-URDIP Data Center, Pune, India.
Multi-Industry IT Operations (SAP, Automotive, Financial, Government)
IT Infrastructure Management Cloud Operations (AWS, Azure, Hybrid Cloud)
Datacenter Management: Physical Rack, Switch, Cabling, Server Installation, Environmental & Power Management
Datacenter & Migration Management (On-prem → Cloud, P2V, V2V)
SAP Infrastructure Design & Operations on AWS Cloud
High Availability & Cluster Management (Pacemaker / Corosync)
Linux & UNIX Administration (RHEL, SUSE, Oracle Linux, AIX, Solaris)
Virtualization & Backup Management (VMware, PlateSpin, HP Backup)
Automation with Ansible & Shell Scripting, Infrastructure-as-Code
Monitoring with Icinga & Oracle Cloud Control (OEM)
ITIL-based Service Delivery, Change & Problem Management
IT Service Delivery (Incident, Change, Problem, RCA, ITIL Framework, IT Security, Audit & Vendor Management)
Team Leadership Process Optimization Vendor & Stakeholder Management
AWS (EC2, VPC, IAM, S3, CloudWatch, Patch Manager)
Hybrid connectivity design (On-prem ↔ AWS)
DR topology design & RTO/RPO validation
Landing zone & environment standardization
Cloud cost optimization & right-sizing
Capacity planning & performance forecasting
RHEL, SLES, Oracle Linux, Ubuntu
SAP ASCS/ERS clustering (Pacemaker, Corosync)
SAP on AWS architecture
Kernel tuning & performance optimization
Patch governance & vulnerability compliance
EOL modernization strategy
Incident trend analytics
MTTR reduction frameworks
RCA governance models
Observability architecture (CloudWatch, Icinga)
SLA adherence & risk mitigation
Disaster recovery simulations
Ansible automation framework
Shell scripting
Red Hat Satellite / Orcharhino
Server build automation
Change governance & lifecycle management
Operating Systems: RHEL, CentOS, SUSE Linux Enterprise Server (SLES), Oracle Linux, AIX, Windows Server
Cloud Platforms & Monitoring: AWS EC2, S3, IAM, VPC, CloudWatch, Patch Manager, Azure Cloud Services
SAP & Databases: SAP HA Cluster (ASCS/ERS), SAP NetWeaver, SAP HANA, Oracle DB, WebLogic, Tomcat
Clustering & High Availability: Pacemaker, Corosync, SUSE HAE, Red Hat Cluster Suite, PlateSpin DR, DR/Failover setups
Virtualization & Automation: VMware vSphere/ESXi, HA/DRS, VMotion, Ansible, Red Hat Satellite, Orcharhino, Shell Scripting, Server Build Automation
Networking & Security: TCP/IP, VLAN, DNS, NTP, Firewalls (Endian/IPCop, Cisco), Load Balancers, Multi-datacenter connectivity
Storage & Backup: LVM, iSCSI, SAN/NAS/DAS, RAID, HP Storage, PlateSpin Forge, HP MSL Tape Library