Summary
Overview
Work History
Education
Skills
Extracurricular Activities And Open Source
Personal Information
Languages
Projects
Highlighted Achievements
Affiliations
Accomplishments
References
Timeline
Generic
GAUTAM KUMAR

GAUTAM KUMAR

Kolkata

Summary

AI Engineer and Senior Data Scientist with over 12 years of experience in developing production-grade GenAI and full-stack data science solutions. Expertise in advanced LLM techniques, statistical modeling, and data visualization, driving impactful business decisions. Proven ability to convert research into scalable products while optimizing performance and cost. Recognized for high productivity and efficiency in delivering actionable insights through machine learning and predictive analytics.

Overview

12
12
years of professional experience
5
5
years of post-secondary education

Work History

Business Consultant / Senior Data Scientist

Tech Mahindra
Kolkata
04.2021 - Current
  • Constructed robust RAG pipelines for customer engagement analytics involving hybrid retrieval and vector DB integration.
  • Utilized machine learning alongside statistical methods to build effective predictive models for various applications.
  • Delivered classical data science initiatives such as churn prediction and claim inventory forecasting aligned with KPIs.
  • Developed agentic AI systems leveraging LangChain plus search APIs to automate workflows with confidence scoring mechanisms.
  • Directed MLOps processes by implementing containerized model strategies, CI/CD pipelines, and drift detection protocols.
  • Mentored emerging data scientists while collaborating with product teams to align analytics with business strategies.
  • Transformed analytical insights into quantifiable business outcomes through stakeholder engagement.

IT Analyst / Data Scientist

Tata Consultancy Services
03.2016 - 04.2021
  • Delivered talent‑gap analytics, repeat‑call prediction (7/21 day windows), and multi‑label chat classification using Python, SQL and Power BI; operationalized models into stakeholder dashboards for proactive interventions.
  • Implemented embedding‑based retrieval and classification models for knowledge retrieval; developed ETL and data pipelines for model inputs and reporting.
  • Conducted A/B tests and uplift analysis to evaluate interventions; worked with SMEs to turn insights into process changes.

Software Engineer / BI & Analytics Consultant

3i Infotech
01.2013 - 03.2016
  • Designed ETL pipelines, OBIEE/Tableau dashboards and reporting solutions to drive operational and financial decisions; improved data quality and reporting cadence.
  • Collaborated with stakeholders to gather requirements and translate them into analytics specifications and solutions.

Education

B.Tech - Electronics And Tele-Communications Engineering

BPUT
Cuttack
06.2008 - 08.2012

PGP - Business Analytics

Great Lakes Institute of Management
Gurugram
01.2017 - 01.2018

Skills

  • LLM and RAG
  • MCP
  • LoRA, QLoRA, and PEFT
  • Hybrid retrieval pipelines
  • Prompt engineering
  • Agentic AI frameworks
  • Machine learning and statistics
  • Time series forecasting
  • Ensemble methods testing
  • Hypothesis testing
  • Model validation
  • Social media analytics
  • Time-series ARIMA and Prophet
  • Data visualization tools

MLOps infrastructure

  • Monitoring and drift detection
  • Python and R programming
  • SQL database management
  • TensorFlow and PyTorch
  • Hugging Face libraries
  • FastAPI and Flask frameworks
  • Vector storage solutions
  • Project planning and management
  • Team building strategies
  • MLOps implementation techniques
  • Stakeholder engagement strategies

Extracurricular Activities And Open Source

  • Top 5% in multiple hackathons: WNS Analytics Wizard (2019), Club Mahindra Data Olympics, and LTFS Data Science FinHack
  • GitHub contributor: repositories demonstrating RAG pipelines, LoRA fine-tuning examples, and production MLOps patterns

Personal Information

  • Date of Birth: 12/18/90
  • Nationality: Indian

Languages

  • English
  • Hindi

Projects

Cisco GenAI-powered organizational hierarchy validation (agentic AI) built an agentic pipeline using LangChain, Langraph, and search APIs to autonomously validate org structures. Components include web scraping, evidence retrieval, confidence scoring, human-in-the-loop review, and using AWS Bedrock 

Sharecare Customer engagement analytics, developed a scalable end-to-end call analytics platform integrating Whisper transcription, speaker diarization, sentiment and empathy detection, and LLM-based summarization Applied LoRA fine-tuning on Llama 3.1 models, implemented monitoring and feedback loops, and deployed using Docker on AWS with CI/CD pipelines Advanced RAG Summarization & QA System, Engineered hybrid retrieval (semantic + sparse), vectorDB integration, chunking strategy, and LoRA-based fine-tuning of reader models to improve domain QA and reduce hallucinations.

STC B2B — churn and issue prediction, led churn and issue prediction modeling for B2B accounts using SAS Viya, Python, and Teradata; delivered dashboards and operational workflows enabling proactive retention strategies,

SeeR Analytics Platform- delivered multiple production use cases (Order-to-Activate, Invalid Truck Roll, Ticket Aging) on an AWS-hosted analytics platform; implemented orchestration scripts, Lambda functions, and dashboards for operational monitoring

Highlighted Achievements

  • Architected hybrid RAG systems integrating FAISS/Pinecone with sparse retrievers and chunking strategies; applied LoRA/PEFT fine-tuning to reader models, improving answer relevance and reducing hallucinations.
  • Built agentic AI workflows for automated org-hierarchy validation and QA automation with human-in-the-loop review, confidence scoring and audit trails.
  • Led speech analytics: Whisper transcription, speaker diarization, sentiment & empathy detection, and LLM summarization for post-facto and real-time insights.
  • Established MLOps practices: MLflow model registry, CI/CD, containerized deployment (Docker/K8s), monitoring, drift detection, latency & cost optimisation.
  • Delivered end-to-end analytics solutions (ETL → modelling → dashboarding) that improved operational KPIs and enabled proactive interventions (churn reduction, repeat-call mitigation).

Affiliations

  • Cricket and Coffee

Accomplishments

  • Tech Mahindra ACE Award - 2024

References

References available upon request.

Timeline

Business Consultant / Senior Data Scientist

Tech Mahindra
04.2021 - Current

PGP - Business Analytics

Great Lakes Institute of Management
01.2017 - 01.2018

IT Analyst / Data Scientist

Tata Consultancy Services
03.2016 - 04.2021

Software Engineer / BI & Analytics Consultant

3i Infotech
01.2013 - 03.2016

B.Tech - Electronics And Tele-Communications Engineering

BPUT
06.2008 - 08.2012
GAUTAM KUMAR