Menards
- Designed and developed daily and monthly pacing and spend reporting solutions on the Microsoft Azure platform.
- Built scalable ETL pipelines using Azure Data Factory to orchestrate Spark jobs and trigger workflows based on new data availability in ADLS Gen2.
- Leveraged Azure Databricks (serverless) to develop and execute PySpark/Spark SQL jobs for large-scale data processing.
- Implemented Delta Lake for intermediate data storage, enabling efficient data flow between jobs and supporting debugging and data validation.
- Integrated vendor-specific data from Azure MySQL DB with input datasets containing impressions, clicks, and spend metrics to generate enriched datasets.
- Developed interactive Power BI dashboards on top of curated data to provide actionable insights into campaign performance and spending trends.
Digital Twin
- Designed and developed a predictive analytics solution to forecast the probability distribution of wind power generation for wind turbines.
- Built end-to-end data pipelines using Azure Data Factory and Azure Databricks to process both historical and real-time data.
- Implemented machine learning models in Azure Databricks using PySpark to predict power generation based on sensor data, applying multiple algorithms to improve model accuracy.
- Performed data preparation, feature engineering, and model training for scalable and reliable predictions.
- Integrated historical data from MSSQL and real-time streaming data via Kafka (RESCA server), storing and processing data in Azure Cosmos DB.
Wendy’s – Customer Review Analytics & NLP
- Developed an NLP-based analytics solution to process and analyze ~20 million unstructured customer reviews across 6,650+ restaurant locations.
- Built scalable data processing pipelines using Azure Databricks and PySpark to perform large-scale text analysis including frequency analysis, n-grams (bi/tri-grams), sentiment analysis, and word associations/clustering.
- Performed multi-dimensional slice analysis based on Month, SKU, Complaint Code, Region, and Province to identify key business trends and customer issues.
- Implemented aspect-level and phrase-level sentiment analysis to extract granular insights from reviews and classify customer feedback as positive or negative.
- Developed machine learning models to predict user ratings by correlating sentiment scores, extracted phrases, and historical 5-star ratings.
- Enabled data-driven decision-making by providing actionable insights into customer satisfaction, product performance, and regional trends.