Results-driven ETL professional with almost 2 years of experience in Azure, Database technologies. Proficient in crafting robust data pipelines using Azure Data Factory, facilitating seamless data transfer across diverse sources and destinations. Skilled in architecting intricate SQL database structures to meet dynamic business needs.
Expertise in designing scalable data pipelines using PySpark. Developed end-to-end data processing solutions on Azure, utilizing Azure Data Factory for data cataloging, ETL orchestration, and schema inference, alongside Azure data lake for scalable storage of raw and processed data.
Project 1:
Azure Data Integration Project 6/2022-9/2022
Project : Movie Lens Recommendation System.
Environment: 3/2023-7/2023
Azure Portal,ADF(Azure Data Factory),Databricks Key Responsibilities:
⋄ Developed and deployed scalable data pipelines and workflows using Azure Databricks notebooks and jobs, orchestrating data ingestion, transformation, and loading processes.
⋄ Implemented data transformation and manipulation logic using PySpark RDDs (Resilient Distributed Datasets) and DataFrames, ensuring data quality and integrity.
⋄ Implemented batch and streaming data processing solutions using PySpark Streaming and Azure Databricks Structured Streaming, enabling real-time insights and decision-making.
SnowIOT:
Automotive IoT Data Analytics Project 9/2023-1/2024
Project Description:
Implemented a comprehensive IoT data analytics solution, SnowIOT, focused on real-time vehicle telemetry data analysis. Leveraged Microsoft Azure and Snowflake to store and analyze data, enabling insights into driver behavior and mileage.
Key Responsibilities:
Technologies Used:
Dynamic data engineer with almost 2 years of hands-on experience specializing in developing data pipelines and SQL transformations. Proficient in leveraging Azure Data Factory to orchestrate data workflows and transferring data to Snowflake or SQL Server Management Studio (SSMS) for further analysis and reporting.
MicroSoft Azure
Azure Data Factory
Snowflake
SSMS
Python
MYSQL