
Data Engineer with 4+ years of experience in data migration, ETL development, and cloud-based data platforms. Strong expertise in Azure Data Factory (ADF), Azure Data Lake Storage (ADLS), Python, PySpark, and SQL for building scalable data pipelines and migration solutions. Proven ability to migrate large-scale data from legacy systems to modern cloud environments while ensuring data integrity, performance optimization, and minimal downtime.
Designed and developed end-to-end ETL pipelines using Azure Data Factory (ADF), including pipelines, data flows, datasets, linked services, and triggers for scalable data ingestion and transformation.
Built and configured reusable ADF components such as parameterized pipelines, variables, and dynamic datasets, improving pipeline flexibility and reusability.
Implemented data ingestion frameworks using Copy Activity to load data into staging layers (ADLS/SQL), followed by transformation using stored procedures and data flows to populate target systems.
Developed and optimized data migration workflows, ensuring data accuracy, consistency, and efficient movement from source to target systems.
Monitored and maintained ADF pipelines using built-in monitoring tools, implementing robust error handling, retry mechanisms, and alerting to ensure high pipeline reliability.
Configured Integration Runtime, Linked Services, and Data Sets to securely connect and process data across multiple sources and destinations.
Automated and scheduled ADF pipeline execution using TWS (Tivoli Workload Scheduler), enabling seamless orchestration of batch data processing workflows.
Implemented Slowly Changing Dimension (SCD Type 2) techniques in data warehousing solutions to maintain historical data tracking and auditability.
SQL programming
Python, PySpark
Data Migration,
Azure Data Factory (ADF)
Azure Data Lake Storage (ADLS Gen2)
Databricks
Big data processing,PySpark (Data Transformation, Optimization),Batch Processing (ADF Pipelines)