Experienced Data Engineer with 7.4 years of expertise in crafting, implementing, and refining robust ETL solutions within the Azure ecosystem. Proficient in leveraging Azure Data Lake, Azure Databricks, Azure Data Factory, Azure Synapse Analytics, Azure Blob Storage, and Azure SQL Database to develop efficient data pipelines. Demonstrated success in driving substantial performance enhancements, evidenced by a remarkable 40% reduction in query execution time through adept optimization of Hive and Spark jobs. Proven ability to fortify data security frameworks, resulting in a notable 20% decrease in security incidents . Skilled in query tuning and optimization, particularly adept within Databricks and Spark environments. Passionate about harnessing technology to drive data-driven insights and streamline operations.
• Developed ETL pipelines for Data products like Feature Bank using pyspark. Building Data lake platform on Azure cloud using services like ADLS gen2 ,Azure Databricks, Synapse analytics .
• Data Management, Data Access, Data Governance and Integration , Security, and Operations performed by using Cloudera Platform.
• Supported the tenants applications in all the areas starting from provisioning their environments, access provisioning ,job optimization , pipeline building , fixing applications or platform related issues.
• Optimizing Hive , Spark and Impala workloads for tenant applications.
Azure Databricks
Azure Data Engineer Associate
Azure Data Engineer Associate