A results-driven Principal Data Scientist with 6+ years of experience architecting and deploying high-impact AI/ML, Generative AI, and Digital Transformation solutions across the Automotive, Pharmaceutical, and BFSI sectors. My expertise lies in leading data science teams, managing the end-to-end project lifecycle, and applying advanced techniques like sophisticated RAG architectures, LLM fine-tuning, and multi-agent AI systems to solve complex business problems. I have a proven track record of delivering significant value, from enhancing customer support with multimodal AI agents at Toyota to optimizing multi-million-dollar inventories at Sun Pharma. I am passionate about building scalable, enterprise-grade AI systems and ensuring their ethical deployment by adhering to Responsible AI standards.
Overview
7
7
years of professional experience
3
3
Certifications
Work History
Principal Data Scientist
Ascentt (Client: Toyota Motors North America)
07.2024 - Current
Led a team of 3 Data Scientists, managed project timelines (JIRA), and provided technical mentorship.
Architected an advanced, multimodal RAG pipeline integrating text (manuals), images (diagrams), and structured data (vehicle specs) from disparate sources.
Implemented a hybrid retrieval strategy using BM25 for keyword search and OpenAI/CLIP embeddings for semantic and image-based search, followed by a cross-encoder re-ranking step to maximize context relevance.
Deployed the system on Azure, reducing query resolution time by 40% and ensuring factual consistency, evaluated rigorously using the RAGAS framework.
Designed and deployed multi-agent systems using AutoGen and CrewAI to automate complex workflows like competitor analysis and customer issue resolution.
Developed specialized agents with distinct roles (e.g., Planner, Information Retriever, Data Analyst) that collaborate to perform tasks, reducing manual effort in market research by 70%.
Utilized LangGraph to build a stateful, agentic framework with integrated tools (vector search, calculator, search API), improving first-contact resolution rates by 40%.
Developed an XGBoost model to predict auto loan defaults, achieving 89% AUC-ROC and reducing portfolio default risk by over 12%.
Engineered a real-time early warning system to identify delinquency risk, enabling targeted outreach that reduced 30+ DPD accounts by 18%.
Analyzed inventory distribution and forklift movement patterns using PostGIS and PostGreSQL to identify optimal storage locations for high-turnover parts.
Developed an optimized routing algorithm for forklifts, reducing travel distance and streamlining pick-and-pack processes.
Led development of sales & inventory forecasting models (TimeGPT, NBEATS, Prophet) deployed on AWS Sagemaker with GitHub Actions.
Implemented inventory optimization using Gurobi and orchestrated scalable data pipelines on Databricks with Spark, using Airflow for scheduling and WandB for model versioning.
Developed and deployed a YOLO-based computer vision solution to identify and classify defects in Toyota cars, achieving 95% detection rates and reducing manual inspection time by 60%.
Senior Manager MD's Office Data Scientist and Analytics
Sun Pharmaceuticals
10.2023 - 06.2024
Built a RAG-powered knowledge engine on Pinecone for semantic search across thousands of regulatory SOPs and clinical trial protocols.
Implemented optimized document chunking strategies tailored for complex scientific and legal text, improving retrieval accuracy for highly specific queries.
Evaluated and improved the system for factual accuracy and completeness using metrics like BLEU and ROUGE, enabling faster access to critical compliance information.
Fine-tuned LLaMA and GPT models using QLoRA on a vast corpus of internal pharmaceutical documents to automate SOP summarization, compliance checks, and report generation.
This specialized model understood the nuances of pharmaceutical terminology, reducing manual review and documentation effort by 40%.
Developed a GenAI pipeline using BioNER and LangChain to extract structured data (eligibility, dosage) from unstructured PDFs.
Utilized BERT & LSTM for text classification to determine if a patient meets clinical trial eligibility criteria, streamlining the patient matching process.
Reduced stockouts and excess inventory by $30M through ML-driven inventory optimization (ARIMA, Prophet, XGBoost), cutting provisions by 10%.
Implemented a dynamic pricing engine using price elasticity modeling and linear programming (SciPy), delivering a 5-10% margin uplift.
Deputy Manager Business Excellence and Digital Transformation
Jubilant Life Sciences
08.2021 - 08.2022
Led the digital transformation strategy, creating a manufacturing ecosystem driven by AI and Industry 4.0 for India and North America.
Deployed predictive maintenance models (SVM, ANN, Random Forest) that reduced monthly maintenance expenditure by 50% and improved MTBF by 20%.
Enhanced product robustness by implementing DMAIC methodology and ML models (Random Forest, GBM), reducing batch failures by 30%.
Business Translator Digital Transformation- Ops Next
Dr. Reddy’s Laboratories
07.2018 - 07.2021
Designed and led the ML pipeline for Yield Improvement, increasing the yield of the top 20 SKUs to 98% and delivering annual savings of ₹5.6 Cr.
Utilized explainable AI techniques (SHAP, LIME) and anomaly detection (K-means, DBSCAN) to monitor critical process parameters, reducing the cost of poor quality with a potential impact of ₹3.3Cr.
Architected a multi-constrained schedule optimizer using Gurobi and a Genetic Algorithm for a product mix of 200+ SKUs, reducing logistics costs by 22%.
This work contributed to the facility being recognized by the World Economic Forum as part of the Global Lighthouse Network.
Education
Post Graduation (MBA) - Business Analytics
Indian Institute of Management (IIM)
01.2024
BT-MT Dual Degree - Chemical Engineering
Indian Institute of Technology
01.2018
Grade 12th - HSC Board Maharashtra
Shivaji Science College
01.2013
Grade 10th - SSC Board Maharashtra
Wainganga Vidyalaya
01.2011
Skills
Certification
AWS Certified Machine Learning – Specialty
Timeline
Principal Data Scientist
Ascentt (Client: Toyota Motors North America)
07.2024 - Current
Senior Manager MD's Office Data Scientist and Analytics
Sun Pharmaceuticals
10.2023 - 06.2024
Deputy Manager Business Excellence and Digital Transformation
Jubilant Life Sciences
08.2021 - 08.2022
Business Translator Digital Transformation- Ops Next
Sr Technical Project Manager at Toyota Motors North America [Contract : Ascentt]Sr Technical Project Manager at Toyota Motors North America [Contract : Ascentt]
DIGITAL PRODUCTS & STRATEGY LEAD AMEA at SYNGENTA GLOBAL CAPABILITY CENTER PVT. LTD.DIGITAL PRODUCTS & STRATEGY LEAD AMEA at SYNGENTA GLOBAL CAPABILITY CENTER PVT. LTD.