Mentored 50+ graduate students by meticulously reviewing and debugging their analytics pipelines, covering data cleaning, feature engineering, and clinical entity extraction from healthcare records.
Acted as the go-to technical resource for student doubts while conducting rigorous code and paper reviews to uphold data quality, privacy, and statistical standards across projects.
Research Data Analyst Intern
Rutgers, The State University of New Jersey
New Brunswick, NJ
05.2024 - 08.2024
Co-first author of a quantitative study (n=319) applying hierarchical multivariable regression via IBM SPSS, to identify behavioral and psychosocial drivers of skin cancer screening intent.
Improved model explanatory power from R² = 0.23 to R² = 0.43, demonstrating that perceived barriers, social norms, and risk perception outperform demographic variables as predictors, directly challenging conventional assumptions in prevention research.
Translated findings into strategic recommendations for intervention design, advocating social influence-based outreach over traditional awareness campaigns.
Data Analyst Intern
Tata Consultancy Services Inc.
Remote
06.2022 - 09.2022
Designed a text analytics engine using NLP (spaCy, NLTK, scikit-learn) to classify sentiment in airline customer feedback across 3 classes (positive, neutral, negative), achieving 77% accuracy in proof-of-concept analyses.
Built a full ML pipeline covering tokenization, stopword removal, POS tagging, lemmatization, and TF-IDF feature engineering; benchmarked SVC, Naive Bayes, and Random Forest classifiers to identify optimal model.
Data Analyst and Research Volunteer
Rutgers, The State University of New Jersey
Remote
08.2025 - 03.2026
Conducting structured interviews with healthcare providers to map socio-technical barriers to AI and AR adoption in clinical settings, translating qualitative data into actionable design recommendations.
Synthesizing stakeholder feedback to benchmark clinical tool usability and align AI integration strategy with provider workflow requirements.
Education
Master of Science - Health Informatics
Rutgers, The State University of New Jersey
05.2025
Bachelor of Engineering - Biomedical Engineering
University of Mumbai - Thadomal Shahani Engineering College
Clinical Decision Support: RAG-Powered Guideline QA System, Python, LangChain, Chroma, Groq API, Engineered a RAG pipeline over the VA/DoD Opioid Therapy CPG to enable fast, source-cited answers to clinical questions, directly addressing the challenge of querying a 177-page clinical guideline quickly in practice., Designed the full pipeline end-to-end covering semantic chunking, vector retrieval via Chroma, and prompt engineering to constrain LLM responses strictly to retrieved context with page-level citations and benchmarking multiple models via API., Evaluated against a clinical ground truth dataset using BERTScore, achieving F1=0.82 confirming the system produces semantically accurate, auditable answers reliable enough for clinical reference.
Real-World Evidence (RWE) Study of Treatment Patterns and Safety Outcomes in Atrial Fibrillation Patients, SAS, SQL, Engineered a SAS production pipeline converting 1,000+ fragmented EHR encounters into a validated cohort of 282 AFib patients across 3 treatment arms (NOAC, Warfarin, Aspirin), automating clinical risk scoring with 100% PROC COMPARE validation., Confirmed statistically significant differences in stroke and bleeding safety outcomes between treatment cohorts (p=0.0175) via chi-square analysis and Kaplan-Meier survival modeling., Delivered 7 regulatory-compliant clinical reports (PDF/RTF) covering demographics, treatment patterns, healthcare utilization, and safety outcomes structured to mirror real-world evidence reporting standards.
Medicaid Drug Expenditure Analysis: Cost Driver Identification, MySQL, CMS National Drug Spending Data, Quantified drug pricing inefficiencies by calculating Compound Annual Growth Rates (CAGR) and detecting cost increases in competitive markets that were invisible in single-year snapshots., Uncovered 5-year cost outliers and drugs with accelerating price hikes, providing data-backed insights to enable proactive formulary management and Medicaid budget planning.
U.S. Health Insurance Coverage Trends: Impact of the Affordable Care Act, Tableau, Census Bureau and HHS Data, Visualized 15 years of national insurance data (2008-2023) to quantify ACA policy effects and Medicaid expansion outcomes across 50 states, identifying state-level coverage disparities and trends to inform healthcare access policy debates., Created interactive story dashboards translating complex CMS datasets into policy-relevant narratives for stakeholders.
Timeline
Data Analyst and Research Volunteer
Rutgers, The State University of New Jersey
08.2025 - 03.2026
Graduate Assistant
Rutgers, The State University of New Jersey
09.2024 - 05.2025
Research Data Analyst Intern
Rutgers, The State University of New Jersey
05.2024 - 08.2024
Data Analyst Intern
Tata Consultancy Services Inc.
06.2022 - 09.2022
Master of Science - Health Informatics
Rutgers, The State University of New Jersey
Bachelor of Engineering - Biomedical Engineering
University of Mumbai - Thadomal Shahani Engineering College