Stackoverflow tag predictor (2020): Developed multi-label classification model to predict Stack Overflow tags from question titles and content using NLP techniques and scikit-learn., Lakehouse architecture with Delta tables (2023): Developed a poc for a project to implement lakehouse architecture using delta tables in aws azure leveraging databricks.
“Rise and Shine Award” for contributing social media data connectors to Airbyte open-source project, enabling enterprise-wide data ingestion, “Coach Award” for designing dockerized local development framework, accelerating AWS Glue and EMR serverless job testing across teams