Summary
Overview
Work History
Education
Skills
Zoom Id
Current Project Highlight
Visa
Timeline
Generic
ISHAN KUMAR

ISHAN KUMAR

Professional
Kolkata

Summary

Results-oriented ETL and Data Engineer with over 7 years of hands-on experience delivering scalable data solutions across on-premise and cloud environments. Proven track record of driving ETL modernization efforts using PySpark on AWS EMR, resulting in up to 40% faster batch performance. Adept in building reusable data ingestion frameworks, implementing CDC logic, and automating data validation workflows. Strong expertise in debugging complex data pipelines, optimizing SQL and PL/SQL queries, and delivering business-aligned reporting solutions via Tableau. Passionate about enabling teams through mentorship and technical leadership, with a foundation built on QA engineering and deep skills in data validation, pipeline monitoring, and issue resolution using Snowflake, Splunk, and AWS services.

Overview

14
14
years of professional experience
7
7
years of post-secondary education

Work History

Data Engineer

NALCO
01.2018 - Current
  • Spearheaded migration of ETL pipelines from Informatica to PySpark on AWS EMR.
  • Designed reusable ingestion pipelines for structured and semi-structured data.
  • Implemented logging and monitoring scripts using UNIX to ensure pipeline visibility and compliance.
  • Enabled Tableau dashboards for executive stakeholders by delivering curated data marts.
  • Worked closely with QA to validate business rules and ensured SLA adherence.
  • Mentored junior developers on PySpark, SQL tuning, and ETL best practices.

Test Analyst

NALCO
08.2015 - 12.2017
  • Authored SQL test cases and validated pipeline integrity using Snowflake and Informatica DQ.
  • Executed 300+ test cases monthly across Tableau and AWS-based reporting systems.
  • Conducted log-based QA testing using Splunk for CDC validation.

ETL Developer

Jindal Group
06.2011 - 01.2015
  • Developed and maintained Informatica ETL mappings and supported end-to-end data mart operations.
  • Processed XML feeds and provided L3 support for production jobs.
  • Tuned data flows and authored JMeter test scripts for performance validation.
  • Designed CDC-based transformations for historical accuracy and real-time consistency.

Education

B.Tech - Metallurgical & Materials Engineering

MNIT JAIPUR
Jaipur, India
01.2007 - 01.2011

MBA - Marketing Management

Welingkar Institute
01.2018 - 01.2020

Advanced Certificate - Data Science

IIIT Bangalore
01.2022 - 01.2023

Skills

    PySpark

undefined

Zoom Id

663 260 3782

Current Project Highlight

Analytics Data Mart (ANM).

  • Designed and implemented a scalable PySpark-based data platform hosted on AWS EMR to unify marketing and sales analytics.
  • The system ingests and processes data from multiple sources including Salesforce and Google Analytics, delivering curated datasets for downstream business intelligence and real-time customer targeting.
  • Migrated legacy Informatica ETL workflows to PySpark on AWS EMR, improving scalability and reducing batch processing time by ~40%.
  • Built resilient, reusable ETL pipelines with automated recovery, alerting, and auditing mechanisms.
  • Developed ingestion frameworks capable of processing APIs, CSV, XML, Parquet, and Text data.
  • Automated sanity checks using UNIX scripting for data validation, record counts, and header/trailer validation.
  • Reviewed and optimized complex SQL/PLSQL queries, reducing database load and improving performance.
  • Supported evolving business requirements by aligning ETL logic and transformation rules.
  • Delivered cleansed and enriched datasets to Tableau dashboards for campaign tracking and revenue insights.
  • Collaborated with analysts and stakeholders for requirements gathering and documentation.
  • Enabled automated delivery of daily data feeds to Sales and Marketing for real-time targeting.

Visa

  • I-129, Validity till 30/09/2026
  • B1/B2, Validity till 31/10/2028

Timeline

Advanced Certificate - Data Science

IIIT Bangalore
01.2022 - 01.2023

Data Engineer

NALCO
01.2018 - Current

MBA - Marketing Management

Welingkar Institute
01.2018 - 01.2020

Test Analyst

NALCO
08.2015 - 12.2017

ETL Developer

Jindal Group
06.2011 - 01.2015

B.Tech - Metallurgical & Materials Engineering

MNIT JAIPUR
01.2007 - 01.2011
ISHAN KUMARProfessional