Dynamic and results-driven Data Engineer with expertise in designing and implementing scalable, high-performance data pipelines and real-time analytics solutions. Proficient in cloud platforms like Azure, Databricks, and AWS, with hands-on experience in Big Data technologies including Apache Spark, Kafka, and SQL-based systems. Skilled in developing metadata-driven workflows, optimizing ETL processes, and integrating machine learning models for fraud detection and predictive analytics. Strong focus on data security, governance, and cost-efficient architectures, ensuring compliance and operational excellence. Proven ability to collaborate across cross-functional teams, solve complex data challenges, and deliver impactful solutions that enhance business insights and drive growth in high-scale environments.
Clickstream Analysis and Fraud Detection Pipeline.
Airline Data Pipeline Automation and Incremental Processing with CI/CD Integration
- Designed and implemented an incremental data processing pipeline for airlines data using Azure Data Factory (ADF), ensuring seamless data integration between Azure Data Lake Storage (ADLS) and Azure Synapse Analytics for efficient analytics.
- Established a robust CI/CD process in Azure DevOps, including the setup of repositories, agent pools, and pipelines, enabling automated deployment and minimizing downtime in production.
- Streamlined production releases by automating the deployment of ARM templates for ADF pipelines, ensuring consistency and reducing manual intervention during updates.
- Optimized data workflows and system performance by leveraging Logic Apps for orchestration and seamless integration of various Azure services.
- Collaborated with cross-functional teams to ensure scalability, maintainability, and adherence to best practices in cloud-based data engineering.
Tech Stack: Azure Data Lake Storage (ADLS), Azure Data Factory (ADF), Azure Synapse Analytics, Logic Apps, GitHub, Azure DevOps