Enthusiastic Data Scientist with a focus on AI and LLMs. Committed to continuous learning and advancing innovation in data-driven environments. Passionate about leveraging new tools to enhance decision-making and solve real-world problems.
Predictive Rounding Model.
Directed the development of a machine learning model to address class imbalance, enhancing prediction accuracy for the minority class in large datasets.
Applied techniques like SMOTE and Random Under Sampling, and adjusted class weight parameters to optimize F1-score and other key metrics.
Predictive Rounding Model (Fairness Analysis)
Led the integration of a fairness analysis pipeline within ML flow using Aequitas, focusing on ethical AI practices.
Analyzed and corrected disparities related to gender, race, and region, ensuring fair and transparent outcomes in machine learning models.
Technical Skills
Languages: Python
Tools/Platforms: DeBERTa, RoBERTa, XLNet, DistilBERT, Aequitas, IBM AI Fairness 360
Techniques: Machine Learning, Deep Learning, Sentiment Analysis, Bias Mitigation, Fairness Analysis
NDx (Sentiment Analysis Scoring)
Fine-tuned advanced NLP models (RoBerta-Large, XLNet-Large, DistilBert-Base) for sentiment analysis, achieving a 97% accuracy rate in both regression and classification tasks.
Conducted comparative runtime analysis on GPUs to ensure efficient model deployment.
Benchmarked model performance using newly annotated datasets to ensure robustness and accuracy.
Sentiment Analysis Scoring (Fairness and Bias)
Conducted bias analysis and mitigation for sentiment analysis models, focusing on gender, race, and region disparities.
Utilized tools like Aequitas and IBM AI Fairness 360 (AIF360) for fairness assessment.
Implemented visualizations to display bias distribution and collaborated on model improvements to ensure equitable outcomes.
MDU Consumerism
Project Focus: Extracting insights from consumer reviews through advanced NLP techniques.
Approach: Utilized DeBERTa, fine-tuned for accuracy in aligning multiple features and opinions within long sentences of complex reviews.
Achievements: Enhanced model precision for extracting multiple relevant features and opinions.
Improved quality of insights by ensuring accurate pairing of features and opinions.
Addressed challenges associated with the intricacies of long-form text.
Clinical NER Projects
Data Scientist, Healthcare NLP
[April 2019] - [June 2020]
Aiklu Triage Project
Lead Data Scientist
[June 2020] - [Dec 2022]
Data Science & Artificial Intelligence, INSOFE
Data Science & Artificial Intelligence, INSOFE