Azure Data Engineer with 2.5 years of experience in building and optimizing data pipelines using Azure Data Factory, Databricks (PySpark), and Synapse Analytics. Skilled in ETL, data modeling, performance tuning, and real-time analytics. Expertise in Delta Lake, SQL, cloud data security (RBAC, Key Vault). Passionate about scalable, cost-efficient data solutions and analytics-driven businesses.
Enterprise-Scale Batch Data Processing
Designed and implemented a scalable batch data processing pipeline on Microsoft Azure for a fintech/logistics enterprise, ensuring efficient data ingestion, transformation, and reporting from multiple sources like SAP HANA, PLMS, Freight Tiger, Mobility, HR, and SDMS. Built a cost-efficient, high-performance architecture using Azure Data Factory, Azure Synapse Analytics, and Power BI to enable real-time analytics and business intelligence., 50% faster ETL processing with optimized pipelines., 40% improved query performance using indexing & partitioning., 99.9% availability with proactive monitoring & auto-scaling.
Azure Data Lake Storage Gen2,Delta Lake, Azure Data Factory (ADF), Azure Databricks (PySpark),Azure Synapse Analytics , Fact & Dimension Tables, ADF Triggers, Logic Apps, Event Grid, Azure Active Directory (AAD), Key Vault, Azure Monitor, Log Analytics, Application Insights,