
Senior Data Engineer with 15+ years of total IT experience, including 4+ years in Microsoft Azure Big Data Engineering. Strong expertise in Azure Databricks, Azure Data Factory, ADLS Gen2, Azure SQL, and Azure Lakehouse architectures. Deep understanding of Apache Spark internals (RDD, DataFrame, Dataset) with hands-on experience in PySpark, Spark SQL, batch and streaming pipelines. Proven ability to architect and implement scalable, high-performance data solutions, mentor engineering teams, and collaborate with cross-functional stakeholders to deliver governed, production-ready data platforms.
Cloud & Azure: Azure Databricks, Azure Data Factory, ADLS Gen2, Azure SQL Database, Azure Cosmos DB, Azure Data Explorer
Big Data: Data Lakes, Lakehouse, Apache Spark
Databricks & Spark: Spark Architecture, RDD, DataFrame, Dataset, PySpark, Spark SQL, Databricks Notebooks, Workflows, Unity Catalog, SQL Warehouse, Serverless Compute
Programming: Python, PySpark, SQL, PL/SQL
SQL & Databases: Advanced SQL (complex queries, performance tuning, troubleshooting) Azure SQL, Oracle, SQL Server, PostgreSQL, Teradata
Data Engineering: Batch & Streaming Data Processing, Medallion Architecture (Bronze–Silver–Gold), Delta Lake, Parquet, Avro, JSON, CSV
Data Modelling: Dimensional Modeling (Star/Snowflake), 3NF, Data Warehousing Concepts
DevOps & CI/CD: Azure DevOps, GIT
Analytics: Tableau, Power BI