Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Hi, I’m

Chandan Kumar

Bangalore
Chandan Kumar

Summary

Accomplished Principal Data Scientist with 15 years of experience spearheading innovation in AI, Data Science, Generative AI, Agentic AI, and advanced RAG systems. Proven leader in mentoring cross-functional teams, fostering collaborative environments to design and deploy cutting edge solutions, including multi-modal RAG pipelines, conversational AI and fine-tuned models. Excelled in strategic leadership by aligning AI initiatives with executive objectives, driving multimillion dollar projects to completion through agile methodologies, stakeholder management, and a track record of delivering efficiency gains. Passionate about cultivating talent, scaling enterprise AI infrastructures, and translating complex insights into transformative business value across industries.

Overview

15
years of professional experience

Work History

GSK
Bangalore

Principal Data Scientist
10.2020 - Current

Job overview

  • Project: Field Force Navigator (A 20% raise on an annual sale expected).
  • Description: It provides an ability to sales reps that brings together targeting and interaction planning (i.e., Next Best Action, opportunity engine), with Gen-AI enabled feedback, Insights, and coaching, it also Enhances sales force effectiveness by digitally equipping each sales rep's to achieve a new level of performance and increasing good sell out outcomes.
  • Key Responsibilities & Accomplishments:
  • Used GenAI for Content generation using OpenAI GPT-5.
  • Used RAG model to develop a ChatBot in FFN project for the sales rep's to ask relevant questions.
  • Used Chunking, Embedding, Vector Db, LLM and other components to build a RAG model.
  • Built conversational AI to process recorded audio between sales reps and HCPs, generating key insights on interactions, sentiments, and opportunities to inform campaign strategies.
  • Building and managing chains of prompts and responses to create complex interactions with model.
  • LangChain and LangGraph enables the orchestration of multiple prompts and actions in a sequence.
  • Integrated with other Cognitive services such as Azure Text Analytics, Speech Services, Translation Services and PII Removal etc. to enhance the capabilities of applications using GPT-5.
  • Utilising Azure data storage solutions like Azure Blob Storage, to manage data used for training and inference.

Project: Marketing Navigator (A $10 million annual impact).

The initiative builds an intelligent system that automates campaign creation through GenAI-driven content generation and advanced retrieval mechanisms. It processes HCP data to produce personalized, data-informed marketing strategies quickly, integrating historical trends with real-time insights for precise targeting. Core capabilities include Agentic AI workflows for complex problem-solving and multi-modal RAG for robust data handling.

Roles and Responsibilities

  • Led GenAI content generation using OpenAI GPT-5 to create tailored campaign materials from HCP insights.
  • Designed and implemented Agentic AI and multi-modal RAG systems, enabling LLM-powered autonomous agents to plan, invoke tools, and collaborate on business challenges.
  • Developed end-to-end Agentic RAG pipeline incorporating chunking, embedding, vector databases, and LLMs for accurate data retrieval and generation.
  • Model fine-tuning with domain-specific pharma datasets, including dataset preparation, training execution, and performance validation.

ALTISOURCE BUSINESS SOLUTION
Bangalore

Senior Data Scientist
08.2017 - 10.2020

Job overview

  • Project: US Housing Cost Prediction for Hubzu.com
  • Description: EDA and Data Cleaning, Training machine learning algorithms of Linear Regression, Support Vector Regression, Gradient Boosting Regression, Stacking of various models.
  • Technologies Used: AWS, Python, EDA, and Gradient Boosting Regressor.
  • Performed Data Preprocessing and EDA, Analyzing data and Training machine learning algorithms of Gradient Boosting Regressor and also used Hyper Parameter Optimization for best parameter threshold.
  • Support Vector Regression, Gradient Boosting Regression, Tried Stacking of various models for best fit.

PERSISTENT SYSTEMS. LTD
Bangalore

Senior Data Scientist
09.2015 - 08.2017

Job overview

  • Project: Opinion, Sentimental analysing from surveys using NLP.
  • Description: Sentiment analysis is the computational Study of people's opinions, sentiments, emotions, appraisals, and attitudes towards entities such as products, services, organizations.
  • Technologies Used: Python, RNN, LSTM, NLP, NLTK, Tensorflow.
  • Used various NLP techniques like: Sentence Segmentation, Word Tokenization, Stop words removal, Punctuations, Lemmatization and word to vector conversion etc.
  • Project: Intuit Turbo VISA Debit Card
  • Description: Improving Effectiveness of Predicting Credit Card Default Payment, as it is very important for any card company to be able to recognize fraudulent of Debit card transactions.
  • Technologies Used: Python, EDA, PCA, SVM, Matplotlib, Seaborn.
  • Performed EDA and Data Cleaning, Training machine learning algorithms of SupportVectorRegressor.

ASCENDUM SOLUTION PVT. LTD
Bangalore

Senior Software Engineer
09.2014 - 09.2015

Job overview

  • Project: Marketing- Live Site, Cross Browser.
  • Responsibilities:
  • Worked as, Python Developer.
  • Worked on designing and developing the backend system using python from scratch.

INDIUM SOFTWARE PVT. LTD
Bangalore

Senior Software Engineer
11.2013 - 08.2014

Job overview

  • Project: Card Wallet.
  • Responsibilities:
  • Worked as, JAVA Developer.
  • Worked on designing and developing the backend system using JAVA and Rest API.

LOGICA INDIA PVT. LTD
Bangalore

IT Consultant
03.2011 - 04.2013

Job overview

  • Project: British Telecom(BT).
  • Responsibilities:
  • Worked as, JAVA Developer.
  • Worked on designing and developing the backend system using JAVA and Rest API.

Education

Bharath University
Chennai, 73

Bachelor of Technology from Computer Science and Engineering
2009

Skills

  • Python
  • Spark
  • LangChain, LangGraph
  • LLM, Agentic AI, and RAG
  • Supervised Learning
  • Unsupervised learning
  • Machine learning, neural network
  • Natural language processing
  • Azure
  • AWS
  • Domain: Pharma, Finance, Banking, Retail, and Telecom
  • Team leadership
  • Mentoring
  • Project Management
  • Cross-functional collaboration
  • Stakeholder Management

Accomplishments

  • GSK IPTC(Ahead Together) Award, 2024
  • Gold award, 2025

Timeline

Principal Data Scientist

GSK
10.2020 - Current

Senior Data Scientist

ALTISOURCE BUSINESS SOLUTION
08.2017 - 10.2020

Senior Data Scientist

PERSISTENT SYSTEMS. LTD
09.2015 - 08.2017

Senior Software Engineer

ASCENDUM SOLUTION PVT. LTD
09.2014 - 09.2015

Senior Software Engineer

INDIUM SOFTWARE PVT. LTD
11.2013 - 08.2014

IT Consultant

LOGICA INDIA PVT. LTD
03.2011 - 04.2013

Bharath University

Bachelor of Technology from Computer Science and Engineering
Chandan Kumar