
Innovative Senior Data Scientist specializing in generative AI and machine learning. Engineered a voice-enabled AI platform, significantly enhancing training efficiency. Expertise in prompt engineering and scalable architectures, focused on solving complex problems and optimizing AI systems. Proficient in machine learning, predictive analytics, and big data processing, effectively translating data into actionable insights to drive project success.
• Architected Simulate.AI, a voice-enabled AI simulation platform (FastAPI + OpenAI Realtime API) serving 1,000+ pharma reps — real-time WebSocket audio streaming, 15+ AI doctor personas, and sub-second latency, reducing training setup time by 40%
• Engineered a custom voice pipeline (PCM16 → VAD silence detection → Whisper STT → GPT-4o-mini → TTS) with async WebSocket orchestration and streaming audio playback, achieving 1.5–3.5s end-to-end latency
• Designed a 6-agent Automated Model Migration Framework to evaluate and refine 500+ LLM prompts during model deprecations — LLM-as-Judge scoring, pharma compliance checks, and auto-rollback, reducing migration cycles from weeks to days
• Integrated Simulate.AI into Microsoft Teams via Azure Communication Services, bridging real-time media streams to the existing OpenAI Realtime API backend for in-call AI roleplay
• Built Coach.AI Hub, an agentic hybrid RAG system (LangGraph + FAISS + BM25 + cross-encoder reranking) with a Planner Agent, query decomposition, and LLM-as-Judge validation — replacing naive RAG and eliminating hallucination issues
• Built data enrichment engine using multi-source scraping (LinkedIn/Wikipedia/Bing) with GPT-based entity normalization, improving lead accuracy by 35% across multilingual pharma datasets
• Implemented async data pipelines on AWS EKS with optimized connection pooling and structured logging, achieving 50% throughput improvement for large-scale conversational data processing
• Developed PharmRebalance (ZS Hackathon 2026) — a multi-agent inventory rebalancing system with LP optimization, FEFO compliance, safety stock calibration, and digital twin simulation for service-level and patient outcome impact
• Engineered prompt framework achieving 92% consistency in LLM-based coaching quality assessments across 5 dimensions using systematic optimization and few-shot learning
• Systematically debugged and repaired ML codebases across 30+ OpenAI MLE-Bench Kaggle competitions spanning computer vision (image classification, object detection, medical imaging), NLP (text normalization, essay scoring, code understanding), and tabular ML — following strict minimal-change protocols to isolate and fix data pipeline errors, model architecture mismatches, and training loop failures
• Resolved bugs across diverse frameworks (PyTorch, TensorFlow, scikit-learn, XGBoost) including NumPy/CUDA compatibility issues, audio preprocessing shape mismatches, DICOM medical image loading failures, and 3D point cloud parsing errors — validating each fix end-to-end via the MLE-Bench grading pipeline on GCP VMs