Summary
Overview
Work History
Education
Skills
Publications
Accomplishments
Certification
Timeline
Generic

Abhishek Kanniganti

Senior AI Engineer
Hyderabad

Summary

AI professional with over 8 years of experience in conceptualizing, designing, and integrating complex AI-driven solutions across industries. Demonstrated expertise in building end-to-end AI/ML solutions for engineering challenges, NLP/Generative AI applications, and backend systems using Python. Skilled in deploying scalable ML pipelines and integrating LLMs such as GPT-4, LLaMA, and NVIDIA models into production environments. Passionate about innovation and leveraging AI to solve real-world problems

Overview

8
8
years of professional experience
4
4
Certifications

Work History

Senior AI Engineer

Deloitte
05.2021 - Current

Led a team of 7 AI Engineers in delivering AI-driven solutions across diverse industries. Experienced in the full lifecycle of machine learning projects—from use-case identification and model development to deployment and productionization.


Projects:

  • Supply Chain Modernization for Leading Laptop Manufacturer: Led a team of 10 engineers to develop an automated low-code LLMOps and MLOps platform, streamlining the deployment of AI and GenAI solutions. Integrated frameworks such as Langfuse and MLflow, and implemented CI/CD pipelines with Kubernetes-based deployments, to accelerate onboarding. Successfully reduced the average productionization time for supply chain AI use cases from 2 months to 1 month. Incorporated robust features, including Automated Agent Evaluation, Drift Monitoring, and LLM Observability, to enhance system reliability and maintainability.
  • First Draft Response Automation: Developed an advanced RAG and NL2SQL-based response generation asset, leveraging large language models to reduce the efforts required by analysts for writing RFP responses. Additionally, I created a robust pipeline to extract data from PDFs, generate outlines, win themes, and past success stories for a given RFP through LLMs like GPT-4 and Mistral, and perform embedding searches over the knowledge base of unstructured PDFs and a structured database. Created a comprehensive evaluation strategy for evaluating and benchmarking various components of the solution. The application has been productionized with dashboards for token usage, GPU utilization, tracing, and logging, and is currently used by three teams. Its knowledge base has been scaled to over 1,000 documents.
  • Readmission Prediction: Finetuned LLama 3.1 70B-based model to predict patient readmission based on clinical notes for a large hospital in New York. Achieved 75% accuracy on the test data and deployed the model on VLLM after performing various inference optimizations using the Nvidia Nemo framework.
  • Converter Failure Prediction for Wind Turbines: Led a team of three analysts to develop a predictive maintenance solution to identify converter failures in advance. Analyzed data from 17 sensors and extracted key patterns, resulting in converter failure. Developed explainable models to predict failures in converters in advance. As of the first phase, we achieved a 30% accuracy lift in detecting converter failures and a 60% reduction in false alarms compared to their existing systems.
  • Hot Socket Prediction: Developed an ML model using Auto-Encoder and XGBoost to predict high-temperature events, or hot-socket events, in smart meters in advance, and help businesses prevent emergencies resulting from them. Built data processing pipelines leveraging various distributed frameworks like Pyspark, Dask, Rapids to efficiently process 120 TB of sensor data on a GPU cluster. The model is deployed in production and evaluates the sensor data of 4.3 million smart meters in a city and gives the business a list of meters which are going to undergo hot-sockets. The event is a rare event, occurs in about 250 meters out of 4.3 million meters over the period of one year. The model captured 80% of hot-sockets and has a hit-rate of 55%.

AI Engineer

IMI Mobile (Acquired by Cisco)
06.2017 - 05.2021
  • Worked on developing IMIBot’s primary NLP engine and other internal tools by enhancing an open-source library to transform IMIBot from code-based to GUI-driven. This has led to reduction in time taken to build a bot by 80 %.
  • Enhanced the IMIBot platform to handle more than 100K messages per year, and it has improved the accuracy of bots by 10% compared to a rule-based engine.

Education

OMSCS - Computer Science

Georgia Tech
Online
04.2001 -

BE (Hons) - Electronics And Communication Engineering

BITS Pilani
Hyderabad, India
04.2001 -

Skills

    Programming Languages: Python, R, MATLAB

    Machine Learning : PyTorch, Keras, Scikit-learn, OpenCV

    Gen AI: Transformers, Langchain, Hugging Face, GPT-4, Llama, Nvidia LLMs, LangGraph, LangFuse, NVIDA Nemo

    Distributed Frameworks: Spark, Dask, RAPIDS,

    MLOps/DevOps: Kubernetes, Docker, CI/CD, AWS, Airflow, Grafana

Publications

  • Fault Tolerant Topology Generation for Application-Specific NoC – IEEE ISED 2017
  • Optimizing Human-Machine Assignments – HCOMP Journal 2015

Accomplishments

  • Outstanding Performance: Feb 2023, Jun 2023, Oct 2024, Feb 2025,
  • Applause Award: Feb 2022, Oct 2023

Certification

Applications of AI Predictive maintenance - NVIDIA DLI

Timeline

Nvidia LLM Developer Certification – NVIDIA DLI

04-2025

Tensorflow Developer Certification - Tensorflow

03-2024

Applications of AI Predictive maintenance - NVIDIA DLI

11-2022

Recommendation engine using NVIDIA Merlin - NVIDIA DLI

05-2022

Senior AI Engineer

Deloitte
05.2021 - Current

AI Engineer

IMI Mobile (Acquired by Cisco)
06.2017 - 05.2021

OMSCS - Computer Science

Georgia Tech
04.2001 -

BE (Hons) - Electronics And Communication Engineering

BITS Pilani
04.2001 -
Abhishek KannigantiSenior AI Engineer