
Data Engineer with 7 years of industry experience, including 3 years in Core Data Engineering, specializing in designing, building, and optimizing scalable data solutions on Azure cloud platforms. Experienced in developing high-performance batch and real-time data pipelines using PySpark, SQL, Azure Data Factory, and Azure Databricks to improve data availability and enable enterprise-scale analytics in the investment banking domain. Skilled in implementing Medallion Architecture and data governance practices to ensure secure, reliable, and high-performance data processing.
Cloud & Platforms:
Azure Databricks, ADLS Gen2
Programming & Query:
Python (PySpark), SQL, Spark SQL
Big Data & Streaming:
Apache Spark, Structured Streaming, Apache Kafka
Data Engineering:
ETL/ELT Pipelines, Medallion Architecture, Delta Lake, CDC, Data Modeling
Academy Accreditation : Databricks