Diabetes UK, Datahub Developer, Azure Synapse Analytics, Azure Data Lake Storage Gen2, Pyspark, Azure Devops, Git, Developed pipeline for common ingestion framework in Azure Synapse. Extract data from different sources like SFTP, NFS, API and store them in ADLS Gen2. Involved in SQL performance optimization techniques. Developed Pyspark notebook based on business rules in Synapse. Written Parametrized UDF’s for data Validation in Synapse. Involved in migration of data from one layer to another like Silver, Bronze, Gold Etc... Experienced in deploying the pipelines from lower environment to higher environment. Created Databases, Tables, Views based on requirement in Azure Synapse. Co-ordinated with business stakeholders on the requirements of Project. Etihad Airways, Senior Consultant, Azure Data Factory, Azure Data Bricks, Azure Synapse, Spark, Pyspark, Migration of data from Cloudera to Azure Data Lake Storage Gen2. Extract data from Web API to ADLS Gen2 using Azure Data Factory. Developing Azure Data Bricks notebook based on business logic. Using Logic apps and web activity for email alerts for developed pipelines in ADF. Involved in Production deployment activities. Migrated Complex transformations from Cloudera to Azure Environment. Involved and completed all POC’s within the timelines. Created external tables in Azure Synapse. Experience in using version control systems. Configuring Azure Key Vault to store secrets and using them in Datafactory and Data Bricks. Scheneider Electric, IT Consultant, Azure, Hadoop, HDFS, Hive, Spark, python, Databricks, Data Factory, Migrating data from the API (Which we got the data from ALBATROS Application) to Azure Data Lake. Maintaining the data in Various Zones to perform the transformations. Designed various ingestion and processing patterns based on use cases. Written the Pyspark code to create dataframes and operate the source data. Involved in ETL Error Handling logic in Production Environment in azure. Converted On premises stored procedure into Pyspark dataframes. Built complex data ingestion/processing frameworks using Azure Databricks/Python/Pyspark. Documenting the flow of process/data from different applications Involve in project related and architectural calls. Currently working on a POCs in Project Implementations. Advanced Analytics Euro Clear, Big data Developer, Azure, Hadoop, HDFS, Hive, Spark, python, Databricks, Data Factory, Worked closely with the business analysts to convert the Business Requirements into Technical Requirements and prepared low and high level documentation. Imported required tables from RDBMS to HDFS using Sqoop. Developed data pipeline using Flume, Sqoop and map reduce and Spark to ingest customer behavioral data and purchase histories into HDFS for analysis. Develop and run Map-Reduce jobs on a multi Peta byte YARN and Hadoop clusters which processes billions of events every day, to generate daily and monthly reports as per user's need. Developed Apache Spark Applications by using Scala, python and Implemented Apache Spark data processing project to handle data from various RDBMS and Streaming sources.