Summary
Overview
Work History
Education
Skills
Software
Certification
Timeline
Generic

Harnoor Singh

Senior Data Engineer
Sector 1,Airoli,Navi Mumbai 400708

Summary

Experienced Data Engineer with almost 6 years of expertise in designing and implementing scalable data pipelines and applications. Proficient in Google Cloud-based applications and Big Data technologies like GCP, Dataflow, Pubsub, Dataproc,Airflow, CI/CD, Advanced SQL, HDFS, YARN, and Pyspark. Passionate about learning new concepts and helping businesses succeed. Skilled in developing scalable data solutions for large enterprise data from multiple sources, including structured and semi-structured data. Adept in collaborating with cross-functional teams and delivering end-to-end data solutions. Developed a data pipeline using Data Lake that led to a client revenue increase of 19%.

Overview

6
6
years of professional experience
3
3
years of post-secondary education
12
12
Certifications

Work History

Senior Data Engineer

Dunnhumby
Gurgaon
07.2021 - Current

Experience:

  • Worked in Retail and CPG domain to generate recommendations for products under different KPIs and B2B, B2C business models.
  • Worked with data science teams to develop and architect big data pipelines that solve key business problems.
  • Designed and developed batch and streaming ETL pipelines using cloud storage, Pub/Sub, cloud functions, Dataproc, Pyspark and BigQuery.
  • Built scalable data solutions for large enterprise data leveraging structured and semi-structured data from multiple sources.
  • Designed and developed data pipelines using Apache Airflow.
  • Implemented Continuous Integration and Continuous Deployment (CI/CD) using Octopus and gitlab CI/CD.

Associate Consultant Data Engineer

KPMG 
Bengaluru
03.2021 - 07.2021

Experience :

  • Developed solutions that enabled data scientists to exploit data, obtain insight, and augment data analysis techniques.
  • Created a denormalized BigQuery Schema for analytical and reporting requirements.
  • Loaded historical data to Cloud Storage using Hadoop utilities and loaded to BigQuery using BQ tools.


Senior Software Analyst

Capgemini Technology Services India Limited
Navi mumbai
08.2017 - 03.2021

Experience:

  • Worked as a Big Data Engineer in the Data Lake Team for an automobile client where the client wanted to store, process & manage the huge amount of daily data collected from various sources.
  • Developed Pyspark and Spark-SQL applications to extract, transform, and aggregate data from various file formats, uncovering customer usage patterns.
  • Experienced in performance tuning and optimization of Spark applications for setting the right batch interval time, correct level of parallelism, and query optimizer.
  • Proficient in data modelling and engineering tools like Hadoop, Spark, and Kafka.
  • Incorporated data quality into day-to-day work and delivery process.
  • Collaborated with cross-functional teams to create complex data processing pipelines to solve client's challenges.
  • Built multiple scalable data pipelines with end-to-end ETL and ELT processing for Data ingestion and transformation.
  • Developed multiple python libraries for interaction with different Cloud services.

Key Achievements:

  • Developed a data pipeline using Data Lake that led to a client revenue increase of 19%.
  • Solved an ETL issue while following Pyspark and Sql best practices that resulted in an insight that increased client’s customer base by 20%

Education

Bachelor of Computer Applications -

Guru Nanak Dev University, 7.8 CGPA
Amritsar,Punjab
05.2014 - 05.2017

Skills

Proficient in Python

undefined

Software

Hadoop Ecosystem ,HDFS, Yarn,PySpark, Scala,Spark-Sql

Python,Shell Scripting ,LinuxJupyter notebooks,Spyder,Pycharm,Anaconda

Gcp,Cloud Dataflow, Big Query, Pub/ Sub, Cloud Shell,Dataproc, Cloud Functions

Mysql, Oracle,Sql Server

ADF,AZURE Data Lake,Azure Blob storage

Jupyter notebooks,Spyder,Pycharm,Anaconda 

AWS Lambda,Kinesis Data Stream

Certification

Data Engineering with Google Cloud Professional Certificate from Coursera

Timeline

Senior Data Engineer

Dunnhumby
07.2021 - Current

Associate Consultant Data Engineer

KPMG 
03.2021 - 07.2021

Data Engineering with Google Cloud Professional Certificate from Coursera

07-2020

Oracle Autonomous Database Cloud 2019 Certified Specialist

07-2020

Oracle Cloud Infrastructure Developer 2020 Certified Associate

07-2020

Oracle Cloud Infrastructure Foundations 2020 Certified Associate

07-2020

Google Cloud Platform Big Data and Machine Learning Fundamentals

06-2020

AI Genie Certified From Capgemini AI Academy.

05-2020

Deep learning and neural networks Certification from Coursera

05-2020

Partcipation Certification from Global Data Science Hackathon from Capgemini

05-2020

Automation Engineer Practitioner Certification by Capgemini University

09-2019

Customer Delight award from Capgemini 

02-2019

Data Science Hackathon - FinTech(https://www.credential.net/hz3y6qln)

08-2018

Automation Foundation Level Certification by Capgemini University

06-2018

Senior Software Analyst

Capgemini Technology Services India Limited
08.2017 - 03.2021

Bachelor of Computer Applications -

Guru Nanak Dev University, 7.8 CGPA
05.2014 - 05.2017
Harnoor SinghSenior Data Engineer