5 + years of experience with core focus on building and maintaining ETL pipeline in Azure and AWS. New team members' integration and skill- building into the project. Developed strong skills in Spark, Cloud Technology, and ETL technologies like Databricks and AWS EMR while working on several projects.
Preclinical Pipeline - data42(Novartis)
Data Lake Development for multiple regions (APAC, LATAM, EMEA etc.) - Roche
Epidemiology Parquet Ingestion(EPI) - Gilead Sciences.
Enterprise Data Lake 2.0 (EDL) - Amgen
Python
PySpark
SQL
Databricks
Azure Data Lake Storage
Azure Synapse Analytics
Azure Data Factory
AWS(EMR, S3, Athena, Glue, Redshift)
Hadoop(Hive, Sqoop)
Airflow
Git