Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
Mukul Mundle

Mukul Mundle

Principal Data Scientist
Pune

Summary

A results-driven Principal Data Scientist with 6+ years of experience architecting and deploying high-impact AI/ML, Generative AI, and Digital Transformation solutions across the Automotive, Pharmaceutical, and BFSI sectors. My expertise lies in leading data science teams, managing the end-to-end project lifecycle, and applying advanced techniques like sophisticated RAG architectures, LLM fine-tuning, and multi-agent AI systems to solve complex business problems. I have a proven track record of delivering significant value, from enhancing customer support with multimodal AI agents at Toyota to optimizing multi-million-dollar inventories at Sun Pharma. I am passionate about building scalable, enterprise-grade AI systems and ensuring their ethical deployment by adhering to Responsible AI standards.

Overview

7
7
years of professional experience
3
3
Certifications

Work History

Principal Data Scientist

Ascentt (Client: Toyota Motors North America)
07.2024 - Current
  • Led a team of 3 Data Scientists, managed project timelines (JIRA), and provided technical mentorship.
  • Architected an advanced, multimodal RAG pipeline integrating text (manuals), images (diagrams), and structured data (vehicle specs) from disparate sources.
  • Implemented a hybrid retrieval strategy using BM25 for keyword search and OpenAI/CLIP embeddings for semantic and image-based search, followed by a cross-encoder re-ranking step to maximize context relevance.
  • Deployed the system on Azure, reducing query resolution time by 40% and ensuring factual consistency, evaluated rigorously using the RAGAS framework.
  • Designed and deployed multi-agent systems using AutoGen and CrewAI to automate complex workflows like competitor analysis and customer issue resolution.
  • Developed specialized agents with distinct roles (e.g., Planner, Information Retriever, Data Analyst) that collaborate to perform tasks, reducing manual effort in market research by 70%.
  • Utilized LangGraph to build a stateful, agentic framework with integrated tools (vector search, calculator, search API), improving first-contact resolution rates by 40%.
  • Developed an XGBoost model to predict auto loan defaults, achieving 89% AUC-ROC and reducing portfolio default risk by over 12%.
  • Engineered a real-time early warning system to identify delinquency risk, enabling targeted outreach that reduced 30+ DPD accounts by 18%.
  • Analyzed inventory distribution and forklift movement patterns using PostGIS and PostGreSQL to identify optimal storage locations for high-turnover parts.
  • Developed an optimized routing algorithm for forklifts, reducing travel distance and streamlining pick-and-pack processes.
  • Led development of sales & inventory forecasting models (TimeGPT, NBEATS, Prophet) deployed on AWS Sagemaker with GitHub Actions.
  • Implemented inventory optimization using Gurobi and orchestrated scalable data pipelines on Databricks with Spark, using Airflow for scheduling and WandB for model versioning.
  • Developed and deployed a YOLO-based computer vision solution to identify and classify defects in Toyota cars, achieving 95% detection rates and reducing manual inspection time by 60%.

Senior Manager MD's Office Data Scientist and Analytics

Sun Pharmaceuticals
10.2023 - 06.2024
  • Built a RAG-powered knowledge engine on Pinecone for semantic search across thousands of regulatory SOPs and clinical trial protocols.
  • Implemented optimized document chunking strategies tailored for complex scientific and legal text, improving retrieval accuracy for highly specific queries.
  • Evaluated and improved the system for factual accuracy and completeness using metrics like BLEU and ROUGE, enabling faster access to critical compliance information.
  • Fine-tuned LLaMA and GPT models using QLoRA on a vast corpus of internal pharmaceutical documents to automate SOP summarization, compliance checks, and report generation.
  • This specialized model understood the nuances of pharmaceutical terminology, reducing manual review and documentation effort by 40%.
  • Developed a GenAI pipeline using BioNER and LangChain to extract structured data (eligibility, dosage) from unstructured PDFs.
  • Utilized BERT & LSTM for text classification to determine if a patient meets clinical trial eligibility criteria, streamlining the patient matching process.
  • Reduced stockouts and excess inventory by $30M through ML-driven inventory optimization (ARIMA, Prophet, XGBoost), cutting provisions by 10%.
  • Implemented a dynamic pricing engine using price elasticity modeling and linear programming (SciPy), delivering a 5-10% margin uplift.

Deputy Manager Business Excellence and Digital Transformation

Jubilant Life Sciences
08.2021 - 08.2022
  • Led the digital transformation strategy, creating a manufacturing ecosystem driven by AI and Industry 4.0 for India and North America.
  • Deployed predictive maintenance models (SVM, ANN, Random Forest) that reduced monthly maintenance expenditure by 50% and improved MTBF by 20%.
  • Enhanced product robustness by implementing DMAIC methodology and ML models (Random Forest, GBM), reducing batch failures by 30%.

Business Translator Digital Transformation- Ops Next

Dr. Reddy’s Laboratories
07.2018 - 07.2021
  • Designed and led the ML pipeline for Yield Improvement, increasing the yield of the top 20 SKUs to 98% and delivering annual savings of ₹5.6 Cr.
  • Utilized explainable AI techniques (SHAP, LIME) and anomaly detection (K-means, DBSCAN) to monitor critical process parameters, reducing the cost of poor quality with a potential impact of ₹3.3Cr.
  • Architected a multi-constrained schedule optimizer using Gurobi and a Genetic Algorithm for a product mix of 200+ SKUs, reducing logistics costs by 22%.
  • This work contributed to the facility being recognized by the World Economic Forum as part of the Global Lighthouse Network.

Education

Post Graduation (MBA) - Business Analytics

Indian Institute of Management (IIM)
01.2024

BT-MT Dual Degree - Chemical Engineering

Indian Institute of Technology
01.2018

Grade 12th - HSC Board Maharashtra

Shivaji Science College
01.2013

Grade 10th - SSC Board Maharashtra

Wainganga Vidyalaya
01.2011

Skills

Certification

AWS Certified Machine Learning – Specialty

Timeline

Principal Data Scientist

Ascentt (Client: Toyota Motors North America)
07.2024 - Current

Senior Manager MD's Office Data Scientist and Analytics

Sun Pharmaceuticals
10.2023 - 06.2024

Deputy Manager Business Excellence and Digital Transformation

Jubilant Life Sciences
08.2021 - 08.2022

Business Translator Digital Transformation- Ops Next

Dr. Reddy’s Laboratories
07.2018 - 07.2021

BT-MT Dual Degree - Chemical Engineering

Indian Institute of Technology

Grade 12th - HSC Board Maharashtra

Shivaji Science College

Grade 10th - SSC Board Maharashtra

Wainganga Vidyalaya

Post Graduation (MBA) - Business Analytics

Indian Institute of Management (IIM)
Mukul MundlePrincipal Data Scientist