Lead Data Engineer/Senior Data Architect with over 10 years of experience in design and development of data pipelines and processing huge volumes of data using tools and frameworks in the Hadoop and Spark ecosystem. Also I’m proficient in a variety of platforms, languages and methods including Hadoop, Spark core, Spark sql, Hive scripts, No-SQL - HBase, HDFS, Airflow, Kafka,S3, EMR, EC2, Lambda, Cloud Watch, DynamoDB, Impala on both On-prem and Cloud (AWS).
Client : Nike, United States of America
Project Name :NGAP Migration
The main objective of this project is to move all the airflow jobs from EDF(persistent cluster) to NGAP(Next Generation Analytics Platform – MAP Cluster).
Roles & Responsibility:
Client : Telia, Sweden
Project Name : Airflow Migration
The main objective of this project is to extract and load the CRC data into CDL and migrate all Talend jobs into Airflow pipelines.
Roles & Responsibility:
Project Name: Performance Management Backlog
The main objective of this project is to extract and load the GE Power equipment’s data into data lake.
Roles & Responsibility:
Client : PNC, United States of America
Project Name: EPK Metric Dashboard
The main objective of this project is to extract the mnemonics data from Zenoss database to EPK database for all the metric.
Roles & Roles and Responsibility:
Client : Common Wealth Bank of Australia, Australia
Project Name : Global Asset Management
The main objective of this project is to provide small work development (SWR) for the applications in GAM segment. The applications involved are SSRS, FTS Interface, Charles River and GAM Scheduler
Roles & Responsibility:
Bigdata Processing Engines : Hadoop - (Hive, HDFS, HBASE, Impala, SQOOP, YARN, AVRO) Spark - (Spark core, Spark SQL, Streaming) Streaming - Kafka
undefined