Big Data Engineer
- Optimized Data Pipelines using Delta cache, Adaptive Queries, Partitioning, Bucketing, etc.
- Collaborated with stakeholders for requirements gathering and understanding and worked in Agile Methodology to develop a Data Engineering model.
- Designed End to End Pipeline using Azure Data Factory and Azure Databricks for Lending project.
- Created Data Quality Checks and automatic notifications on detecting malicious data using Azure Data Factory and Azure Databricks.
- Strengthened POCs to set up environment using Azure Code Deploy on Databricks and ADLS Gen2 environments.
- Conducted performance tuning on PySpark code, optimizing execution time by 20%.
- Achieved proficiency in PySpark, Azure Data Factory, Databricks, ADLS Gen2, Delta Lake, and other Azure services.