Led infrastructure automation, AI-powered analytics integration, and large-scale Kubernetes operations for critical engineering platforms. Specialized in On prem Kubernetes management, CI/CD pipeline optimization, cloud-native automation, and AI/ML-driven insights.
Key Responsibilities & Achievements:
- Bare-Metal Kubernetes & Infrastructure Management:
Administered multi-node Kubernetes clusters, adding nodes, configuring networking, and resolving scheduling bottlenecks for microservices.
Managed VM lifecycle activities (commissioning, decommissioning, migration) across global data centers (BGL, RTP, FUL) including PXE-to-DCAS-11 migration and U32-to-BGL11 data center transition.
Deployed load balancers for RKE2 control nodes to enhance request handling and fault tolerance.
- CI/CD, Automation & DevOps:
Built and maintained complex Dockerfiles (CentOS → Rocky Linux upgrade, PyATS custom images, Cypress integration) optimizing builds and dependencies.
Automated release workflows and performed numerous hotfix/full releases for CIM applications.
Created and maintained Kubernetes manifests, Ansible playbooks, and Terraform configurations for infrastructure provisioning and configuration management.
Developed custom monitoring alerts in Prometheus/Grafana to track infrastructure health, storage (OpenEBS), and application metrics.
- Scripting & Process Optimization:
Wrote Python and shell scripts for operational automation (e.g., log rotation, credential updates, SSL certificate renewal).
Designed troubleshooting scripts for MongoDB performance issues, Selenium container quota enforcement, and network-related debugging.
- AI/ML Contribution – AI-Powered Tableau Dashboard Analysis Bot: Designed and deployed a conversational AI bot integrated with Webex Teams using Python, Flask, Azure OpenAI, and Redis clustering.
Implemented LLM-based function calling with specialized tools to dynamically analyze pipeline performance, defect tracking, and quality metrics.
Achieved 95% accuracy in intent classification, enabling natural language-driven Tableau insights, and reducing manual data retrieval time by 80%.
- Cross-Team Collaboration & Support:
Assisted engineering teams with infrastructure debugging, container access, and CI failures.
Led Bitbucket-to-GitHub migration for TBAAS, including repo restructuring, pipeline updates, and documentation.
Tech Stack: Kubernetes (Bare Metal), Docker, CI/CD (Jenkins, GitHub Actions), Terraform, Ansible, Python, Shell, Azure OpenAI, Tableau REST API, Prometheus, Grafana, Redis, Webex Teams API, MongoDB.