Cloud Operations Engineer with 6 years of comprehensive experience in managing, automating, and supporting large-scale IT infrastructures across AWS, Azure, GCP, VMware [ on-premises data centers]. As a proactive Team Lead, I manage a 20-member team responsible for the end-to-end operations of 200+ diverse customer environments. I excel in Linux administration, infrastructure automation using Ansible and Shell scripting, and containerization with Docker all aligned with SRE principles to ensure operational efficiency, system reliability, and scalable cloud solutions.
I have successfully executed complex projects including storage and network migrations, peering configurations, and system scaling with advanced storage adjustments such as database restore and recovery. Skilled in incident management, alert resolution, and leading major incident bridges, I also act as a critical liaison between management, SRE, and product teams during new customer onboarding. Notably, I spearheaded the development of an Ansible-based AWS patching framework that significantly reduced execution time and enhanced operational efficiency.
This dynamic blend of leadership, technical expertise, and project management underpins my ability to deliver high-impact, reliable cloud solutions that drive business success.
Linux Administration & Operations:
AWS Cloud Operations:
Compute & Instance Management:
Backup & Recovery:
Network & Security:
Storage Management:
Azure Cloud Operations:
Backup & Recovery:
Compute & Migration:
Network Services:
Automation, Git & Docker Operations: