Results-oriented Data Scientist with expertise in healthcare analytics and a proven track record at UnitedHealth Group (UHG). Skilled in leveraging advanced analytical techniques and tools like PySpark, Hive, and SQL to extract actionable insights from complex healthcare datasets. Dedicated to optimizing patient outcomes, enhancing operational efficiency, and driving strategic decision-making.
• Developed and implemented an automated pipeline to process patient visit data from barcodes.
• Managed and optimized data processing at scale, handling up to 2M barcodes per week
• Identified high-risk patients by filtering barcodes with high disease probability scores, reducing false positives by 25% and increasing detection rate by 30%
• Generated projections for business performance and patient outcomes, improving strategic planning and resource allocation
• Compared performance of two NLP models through projection analysis, selecting superior model which improved accuracy by 15%
• Worked on a clustering project to optimize coder performance, assigned coders to clusters (C1, C2, C3, C4, C5) based on their past performance using clustering algorithms.
• Enhanced team efficiency by categorizing coders into performance-based clusters, leading to more effective workload distribution and improved coding accuracy
Big Data Technologies: HDFS, YARN, MapReduce, Spark, HIVE, Presto, HBASE
Resolvr: Data Analytics Case Study - python, Power BI, Excel,