Summary
Overview
Work History
Education
Skills
Certification
Languages
Awards
Languages
Timeline
Generic
Heeresh Raj

Heeresh Raj

Noida

Summary

Dynamic Azure Data Engineer with a proven track record at Capgemini, excelling in data ingestion and pipeline development using PySpark and Spark SQL. Adept at mentoring teams and ensuring data quality, I thrive on transforming complex datasets into actionable insights while fostering collaboration and innovation in data engineering practices.

Overview

7
7
years of professional experience
1
1
Certification

Work History

Azure Data Engineer - Allianz Simplifi

Capgemini services
Noida
01.2025 - Current
  • Worked on a project named Allianz SimpliFi as an Azure Data Engineer here.
  • Implement the ingestion of data from .zip (.dat and .ctl) files in Azure Data Lake Storage Gen2 accounts to various layers.
  • Writing code in PySpark and Python for data ingestion and the creation of temp views.
  • Use Spark SQL in Azure Synapse Workspace for table creation and business transformations.
  • Developed data pipelines to process and transform large datasets efficiently.
  • Ensured data quality by conducting regular audits and validation checks.
  • Mentored junior team members on best practices in data engineering methodologies.
  • Analyzed user requirements, designed and developed ETL processes to load enterprise data into the Data Warehouse.
  • Prepare reports based on the analysis in Power BI.

Data Engineer - National Australian Bank (NAB)

Capgemini services
Noida
07.2023 - 12.2024
  • Designed data pipelines for efficient data processing and storage.
  • Collaborated with teams to gather data requirements and specifications.
  • Maintained data quality by performing regular audits and validations.
  • Assisted in troubleshooting data issues across various platforms.
  • Documented technical processes and developed user guides for team use.
  • Provided technical mentorship to junior data engineers, guiding them on best practices and project execution.
  • Conducted data analysis using SQL and Python to derive insights and support decision-making processes.

Data Engineer - Jansen Pharma

Tata Consultancy Services (TCS)
Noida
08.2021 - 06.2023
  • I worked as a support data engineer on the project named Jansen Pharma.
  • Analyzed user requirements, designed and developed ETL processes to load enterprise data into the Data Warehouse.
  • Monitored data systems performance, identifying bottlenecks and implementing solutions to maintain system efficiency.
  • Collaborated with cross-functional teams to define data requirements.

Big Data Engineer - Johnson and Johnson (JNJ)

Tata Consultancy Services (TCS)
Noida
08.2019 - 07.2021
  • Project based on a medicinal company Johnson and Johnson (JNJ), being a developer and tester here, used to write queries in hive, modify code in shell script, prepare configs and query files and performs unit testing
  • Create Change Request (CR) for migration between environments (from Dev to QA environment and from QA to Production), provides support and performs job validation.
  • Write queries in hive at tool named hue for analysis work.
  • Implement Jira for user story related changes, create test plan and test execution in the respective sprint.
  • Use Confluence to create flow diagram and store other things as a record in it.
  • Use Bit bucket to store code and other files as a backup.

Database Administrator - Eli Lilly

Tata Consultancy Services (TCS)
12.2018 - 07.2019
  • Project Based on a Medicinal Company named Eli Lilly, experience in Database administration activities like Installation, Configuration, Backup and Recovery, Database Security, Data Transformation Services, Database design and Query optimization.
  • Experience of production support and troubleshooting skills.
  • Create Various jobs to monitor resources on various servers and to mail the necessary information when threshold was reached.
  • Demonstrated technical skills across multiple database technologies and computing platform technical specialties.
  • Manage and execute daily operational jobs including backups, index/statistics updates and database restoration.

Education

Master of Computer Application - MCA

Aligarh Muslim University
Aligarh, Uttar Pradesh
07.2018

B.Sc (Hons.) - Computer Application (BCA)

Aligarh Muslim University
Aligarh, Uttar Pradesh
08.2015

Intermediate - 12th

Wisdom Public School
Aligarh, Uttar Pradesh
04.2012

High School - 10th

Gagan Public School
Aligarh, Uttar Pradesh
04.2010

Skills

  • SQL
  • Spark SQL
  • PySpark
  • Azure services
  • Data bricks
  • Synapse Analytics workspace
  • Azure Data Factory
  • Data analysis
  • ETL process development
  • Cloud data storage
  • Technical documentation
  • Mentoring team members
  • Data migration
  • Data curating
  • Jira
  • Hue

Certification

  • Microsoft Certified: DP-203 (Azure Data Engineer Associate) (Certificate ID: 1552-6423)
  • Microsoft Certified: DP-900 (Azure Data Fundamentals) (Certificate ID: 1306-4799)
  • Databricks certified: Academy accreditation - Databricks Lakehouse fundamentals (Certificate ID: 70710167)
  • Coursera Certification: Google Cloud Big Data and Machine Learning Fundamentals (Certification URL: https://www.coursera.org/account/accomplishments/verify/QLXLMZMLG86G)
  • Udemy certification: Azure Databricks and Spark for data engineers (PySpark/SQL) (Certificate ID: UC-eb27fad2-db49-4be8-84fb-c8642ff81d3b)
  • Udemy Certification: Best hands-on big data practices with PySpark and Spark tuning (Certificate URL: https://www.udemy.com/certificate/UC-5c717197-9735-414a-96ba-6d91cc42075c/)
  • Udemy Certification: Mastering SQL for Data Science (Certificate ID: UC-56821c61-c0e9-4334-a1bd-95df3ece5614)

Languages

English, Hindi

Awards

  • Received multiple appreciations from stakeholders for the work done on the project in Capgemini.
  • Received the Customer WOW Delight Award for the overall performance in Capgemini
  • Got the Service Commitment Award for completing three years at TCS, got appreciation from customers for the work done, Fresco Play Miles Award for consecutive learning

Languages

Hindi
First Language
English
Intermediate (B1)
B1

Timeline

Azure Data Engineer - Allianz Simplifi

Capgemini services
01.2025 - Current

Data Engineer - National Australian Bank (NAB)

Capgemini services
07.2023 - 12.2024

Data Engineer - Jansen Pharma

Tata Consultancy Services (TCS)
08.2021 - 06.2023

Big Data Engineer - Johnson and Johnson (JNJ)

Tata Consultancy Services (TCS)
08.2019 - 07.2021

Database Administrator - Eli Lilly

Tata Consultancy Services (TCS)
12.2018 - 07.2019

Master of Computer Application - MCA

Aligarh Muslim University

B.Sc (Hons.) - Computer Application (BCA)

Aligarh Muslim University

Intermediate - 12th

Wisdom Public School

High School - 10th

Gagan Public School
Heeresh Raj