Summary
Overview
Work History
Education
Skills
Languages
Certification
Timeline
Generic
NEERAJ KUMAR

NEERAJ KUMAR

New Delhi,DL

Summary

Senior Data Engineer with 7+ years of industry experience, including 4 years at Accenture and over 3 years at ZS. Experienced in developing data-intensive applications and solving complex architectural and scalability challenges in the healthcare and finance sectors. Proficient in building scalable ETL pipelines using tools such as Pentaho and EDLS (AWS Glue), and adept at migrating data from on-premises to the AWS cloud. Holds AWS Certified Solutions Architect (SAA) certification and has hands-on experience implementing and optimizing AWS services according to industry best practices.

Overview

8
8
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

ZS Associates
04.2021 - Current

Project : DSS Dataiku Platform Enablement and Optimization

  • Collaborated with the ETL operations team to develop strategies and conceptualize best practices for the DSS Dataiku platform.
  • Reengineered existing ETL workflows to improve performance by identifying bottlenecks and optimizing code accordingly.
  • Led a team of six Data Engineers, providing mentorship and guidance.
  • Successfully implemented DSS ETL best practices across 150+ projects.
  • Optimized DSS recipes, including SQL, Python, and visual recipes (e.g., Prepare, Sync).
  • Assisted multiple teams in creating, archiving, optimizing, and setting up Dataiku projects, ensuring efficient workflow management and project execution.
  • Received client recognition for optimizing multiple ETL projects and deploying them on the Auto node platform.


Senior Data Engineer

ZS Associates
04.2021 - Current

Project : SDOH (Social Determinants of Health)

  • Build a centralized SDoH Data Lake, enabling enterprise-wide access and supporting data-driven decision making for key business stakeholders.
  • Collaborated cross-functionally with RWE & Commercial colleagues to prioritize and identify public SDoH Data Sources to include in the RWD Data Mart
  • Conducted data profiling and data modeling of 25 publicly available SDoH datasets, subsequently categorizing them into 18 SDoH categories using a developed data lake.
  • Utilized Python web scraping techniques to extract data from publicly available websites, streamlining data collection and enhancing data accessibility for analysis.
  • Led a team of three Data Engineers, providing mentorship and guidance.

Senior Data Engineer

ZS Associates
04.2021 - Current

Project: CTDI(Competitive Trial Design Intelligence)

  • Worked with business to gather, standardize, harmonize & visualize clinical trials information from unstructured data sources to aid trial designs.
  • Developed and implemented a data ingestion and processing pipeline for unstructured data using Python, with outputs published to Elasticsearch (No SQL) for web application integration
  • Configured AWS services including EC2, IAM, API Gateway, Lambda, and RDS to establish a data pipeline. Developed over 30 APIs in Python utilizing AWS Lambda and API Gateway, while leading and mentoring a team of 4 data engineers
  • Collaborated cross-functionally with front-end, data science and data engineering teams

Data Engineer

Accenture Solutions Private
02.2017 - 03.2021

Project : APTP (Accenture Post Trade Processing)

  • APTP is utilized to enhance post-trade processing, specifically in the areas of Trade Confirmation, Clearing, Settlement, Reconciliation, and Risk Management.
  • Developed a data warehouse integrating data from external vendors and corporate sources, including Oracle 10g, flat files, Microsoft Excel files, and XML files.
  • Worked with business users/Analytics team, data architects to identify the business requirements and developed designed document for ETL flow.
  • Tuned performance related issues faced on Production.
  • Migrated on-prem database ( MS SQL) to AWS based database RDS using AWS DMS and SCT service

Education

B.Tech. - Computer Science and Engineering

Maharaja Surajmal Institute Of Technology(GGSIPU)
04.2021

12th Standard -

CBSE
01.2012

10th Standard -

CBSE
01.2010

Skills

  • ETL development
  • Python Programming
  • SQL
  • Shell Scripting
  • Data Warehousing
  • Data Migration
  • NoSQL Elasticsearch
  • Redhat Linux
  • Data Analysis
  • AWS Architecting

  • Snowflake
  • AWS Development
  • Data Pipeline Design
  • Data Modeling
  • Teamwork and Collaboration
  • Problem-solving abilities
  • SVN Code versioning
  • Certifications:

  • AWS (SAA CO3) solutions architect
  • Dataiku core and advanced designer

Languages

English

Certification

AWS (SAA CO3) solutions architect

Elasticsearch (Udemy)

Python (Coursera)

Dataiku core and advanced designer

Timeline

Senior Data Engineer

ZS Associates
04.2021 - Current

Senior Data Engineer

ZS Associates
04.2021 - Current

Senior Data Engineer

ZS Associates
04.2021 - Current

Data Engineer

Accenture Solutions Private
02.2017 - 03.2021

12th Standard -

CBSE

10th Standard -

CBSE

B.Tech. - Computer Science and Engineering

Maharaja Surajmal Institute Of Technology(GGSIPU)
NEERAJ KUMAR