Working with data has always been a passion of mine, and it’s one of the key reasons I enjoy my work. Writing SQL queries to extract and analyze data is a regular part of my daily routine. I thrive both as an individual contributor and as a collaborative team member. A quick learner, I continuously strive to grow by embracing new experiences and learning from my mistakes.
1. Project: Barnes & Noble (Sep 2022 - Mar 2023).
Description: Executed various data operations, including data profiling, modeling, source-to-target mapping, integration, transformation, and report generation using technologies such as Azure Data Factory, Azure Databricks, ADLS, SQL Server, and Tableau.
Responsibilities: Developed Databricks Notebooks for data storage, transformation, analysis, and cleaning.
Gained hands-on experience with Azure Data Factory, and participated in pipeline testing and data loading.
Applied knowledge of data modelling using the Oracle Data Modeler tool.
2. Project: Valvoline [Apr 2023 - Jun 2023].
Description: This was a demo project for the Valvoline Group, aimed at building a store inventory forecast system. As part of the Data Engineering (DE) team, I worked on processing inventory data from S3 buckets, transforming it, and providing data to the Data Science team.
Responsibilities: Led the creation of end-to-end data pipelines to process and transform inventory data from S3 buckets.
Designed and implemented Databricks medallion architecture (Bronze, Silver, Gold layers) to structure and refine inventory data for forecasting purposes.
Built batch processing pipelines using PySpark, optimizing the processing of large volumes of data.
3. Project: Rebate.ai for Network Distribution [Jul 2023 - Present].
Description: Rebate.ai is a B2B rebate management system that automates rebate processes and identifies revenue opportunities across the rebate ecosystem, including vendors, distributors, and suppliers.
Responsibilities: Designed and developed stored procedures on Snowflake for rebate calculation and processing, optimizing rebate workflows.
Supported three primary user groups: vendors, distributors, and buying groups.
Managed data uploads from vendors, including transaction files and product catalogs.