Adept System Engineer with a proven track record at TESCO Hindustan Service Center, enhancing infrastructure efficiency and team productivity. Excelled in deploying and managing cloud solutions across AWS and Azure, leveraging skills in Kubernetes and Terraform. Demonstrated leadership in guiding teams through complex migrations, achieving significant cost optimizations and performance improvements.
Overview
17
17
years of professional experience
1
1
Certification
3
3
Languages
Work History
System Engineer 2
TESCO Hindustan Service Center
05.2012 - Current
Responsible for Design, implement and provide Operational support of infrastructure and CI/CD pipelines that enables development teams to develop, run and support their APIs.
Manage self-hosted Kubernetes (with istio service mess), ElasticSearch, Couchbase, Redis, Kafka in AWS Cloud. Also managed AKS, Azure redis cache, container registry, key vaults, etc in Azure cloud. We do use managed Confluent cloud for Kafka eventing service.
Using Nexus and GitHub as a code hosting platform for version control and collaboration
Managing CI/CD pipelines using Jenkins and Ansible (AWS hosted application deployments) & Azure DevOps (App deployment and infra provisioning using Terraform)
Implementing monitoring and observability solutions (Such as Runscope, Splunk, AppDynamics, NewRelic, Prometheus and Grafana) for Infrastructure and Applications hosted in cloud.
Manage cost optimisation of infrastructure hosted in public cloud such as AWS and Azure.
Handling and supporting AWS to Azure data and applications migration as part of our strategic solutions.
Using JIRA Scrum & Kanban tools to help teams working on complex projects, collaborate and better structure and manage their workload.
Handling Incident, change and Problem Management.
Supporting in collaboration with Business and Development teams to ensure consistent high levels of functional and performance capacities.
Implementing and manage Infrastructure and Application monitoring, automation of routine monitoring tasks, analysis of system health.
Planning and implementing Live load test on our application and infrastructure to make sure we can cope up with our most yearly peak at Christmas and Thanksgiving.
Handling CAB (change advisory board) for any changes goes live in production.
Conducting weekly service review meetings with higher management and UK stake holders to discuss previous week status (SLA, KPIs, Performance, etc.)
Manage Application support team for UK GHS in production environment 24X7.
Manage team shift Rota, leave plans, and resource allocation for live releases and patching activities etc.
Handling monthly patching activity across our Linux and Windows production servers at enterprise level
Reporting quality metrics of the performing Delivery Centers to Senior Leadership and follow up on service improvement areas.
L2 Server Support Engineer
Replicon Software Pvt Ltd
02.2010 - 03.2012
Manage SaaS (Software as a Service) Infrastructure availability and performance 24/7.
Remote management Windows and Linux servers.
Providing Active directory & DNS support.
Monitor and Manage Datacenter Hosted in Canada, UK, and Australia
Working on Salesforce cloud for CRM (Customer Relationship Management)
Monitor and Manage Software Load Balancer installed in Linux CentOS.
Monitor and Manage HyperV servers through NIM tool.
Deploy MS windows updates and Patch to different Datacenters.
Deploy new Product releases to different Datacenters.
Log Analysis – Windows & Linux Server.
Ensure highest uptime for customers in SaaS environment.
Use SQL server 2005/2008 as a backend for customer Database storage.
Use Web Server (IIS 6/7) to setup and deploy WR demo/Production Website.
Fetching SQL DB Backups using Symantec Backup tool.
Troubleshoot system/app issues on db and app servers.
Deploy application customizations on customer demands.
Disable and clean up canceled instances and provide data backups to customers.
Coordinate scheduled upgrades with customers.
Streamlined support ticket resolution through effective communication and collaboration with team members
Conducted periodic system audits to identify areas of concern or inefficiency, recommending appropriate actions for improvement as needed
IT Operation Engineer
IBM Network Solution Pvt Ltd
10.2007 - 02.2010
Monitoring and managing windows and Linux servers in an AD Environment
Trouble shooting and monitoring servers for availability/ performance through a network management software called "SNAPPiMON" integrated with HP open view service desk 4.5 for automated ticket generation.
Installation Windows 2000/2003 and Linux Servers.
User Management in Active Directory (Creation, Deletion, Add in Different Group)
Setup and configure new workstations, software installation, and network configuration for internal users.
Remote management Windows and Linux servers.
Monitoring Service Desk (HP Open view SD) application
Installation of Lotus Notes, Configuration and troubleshooting the problems.
Provide the Domain configuration supports
Incident, Change & Problem management
Log Analysis – Windows & Linux
Monitoring servers with the help of monitoring tool
Backup jobs Monitoring SQL server
Password Management – Windows
SOM (SMS application-Used for sending sms and e-mail alerts)-Creation/deletion of users, groups, and department
User ID / Mail ID creation, deletion and modification
Creating and troubleshooting user Account/profile
Education
MBA - Management Information Systems
Alliance University
Bengaluru, India
04.2001 -
Skills
Operating Systems: RHEL, Centos, Ubuntu & Windows 2008/2012/2016/2019 servers