Google Certified Solution Architect and software professional with expertise in AI, MLOps, DevOps, cloud computing, and system design. Skilled in Kubernetes, Terraform, Google Cloud, AWS, and AI-driven DevOps automation. Passionate about enhancing system observability, optimizing deployments, and leveraging AI for intelligent automation
* Designed and implemented multi-tenant AI/ML architectures on Kubernetes, enabling selective deployments across thousands of tenants.
* Developed AI agentic frameworks for anomaly detection, automated troubleshooting, and performance optimization of application.
* Built an LLM-powered chatbot for troubleshooting and automating responses to common issues, enhancing customer experience through AI, automation, and NLP.
* Led SonarQube integration into CI/CD pipelines & Github Actions, significantly improving code quality and security.
* Migrated tech stack components, including Grafana On-Prem to Grafana Cloud, and executed major upgrades for Artifactory and MySQL.
* Automated certificate renewals using Python, Ansible, and Terraform, reducing manual effort and operational risk.
* Enhanced system observability, reducing P1/P2 incidents by 70% through AI-driven monitoring, and predictive analytics.
* Mentored cross-functional teams, serving as a subject matter expert in cloud infrastructure, AI automation, and DevOps.
* Developed IaC automated deployment frameworks using Python, Ansible, Terraform, and Kubernetes, streamlining infrastructure provisioning.
* Implemented AI-powered self-healing automation for the postpaid stack, covering 81% of system failures, and improving resilience.
* Designed and optimized monitoring dashboards using ELK, Grafana, and Prometheus for enhanced observability.
* Automated repetitive tasks through custom scripts significantly reduce manual effort and improve application efficiency.
* Enhanced CI/CD pipelines by integrating Bitbucket, SonarQube, and Jenkins, improving deployment speed and code quality.
* Led and mentored teams, ensuring knowledge transfer and best practices in infrastructure automation and DevOps.
* Containerization & Automation: Automated Docker container provisioning for seamless and efficient application deployment.
* Cloud Infrastructure Management: Designed, deployed, and managed AWS cloud infrastructure, ensuring scalability and reliability.
* Scalable Application Development: Built robust application components using Python, enhancing performance and maintainability.
* Monitoring & Debugging: Implemented ELK Stack, Prometheus, and Grafana for real-time monitoring, logging, and issue resolution.
* CI/CD Pipeline Optimization: Designed and managed end-to-end CI/CD pipelines for automated builds, testing, and deployments.
* Customer Issue Resolution: Diagnosed and resolved customer-reported issues, addressing defects and closing technical knowledge gaps.
* Task Automation: Automated manual tasks using Shell scripting and Python, enhancing operational efficiency.
* Telecom Fraud Monitoring: Developed and maintained rule-based systems to monitor and detect telecom fraud.
* Defect Analysis & Issue Resolution: Analyzed and resolved defects related to customer-reported issues, ensuring optimal service delivery.
* CI/CD Pipeline Setup: Designed and implemented CI/CD pipelines to streamline deployment processes and improve efficiency.
* Customer Issue Resolution: Handled and resolved escalated customer issues professionally, employing conflict resolution techniques as needed.
* Operational Efficiency Improvements: Introduced and executed strategies to enhance the efficiency of service delivery operations.