
Results-driven Data Engineer with 4 years of experience designing and optimizing scalable data pipelines using Azure, Data bricks, and PySpark. Improved data processing efficiency by 35% and reduced pipeline errors by 60% through automation and Delta Lake optimizations. Skilled in building HIPAA-compliant ETL work flows for health care analytics, with a passion for translating raw data into actionable business insights. Certified in Azure (DP-203) and committed to continuous learning in distributed systems and data governance
Cloud & Services: Azure Data Factory (ADF), Azure Databricks, Azure Synapse Analytics, Azure Data Lake Gen2, Azure Blob Storage
Programming & Tools: PySpark, Python, SQL, MS SQL Server, Azure Storage Explorer, Git
Data Engineering: ETL/ELT Development, Delta Lake, Data Modeling, Data Governance, Performance Optimization, Debugging
Security & Compliance: HIPAA, PHI Masking, Access Control, Encryption
Other: API Integration, Power BI (Basic Reporting)