Summary
Overview
Work History
Education
Skills
Certification
Immediate Joiner
Personal Information
Timeline
Generic

Pratyush Sahay

Gurugram

Summary

Dynamic data professional with a proven track record at CODEJEE, excelling in ETL pipeline development using PySpark and AWS Glue. Enhanced query performance by 50% through data optimization and integrated real-time analytics in React. Skilled in SQL and Agile methodologies, demonstrating strong problem-solving and collaboration abilities.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Associate

CODEJEE
09.2021 - 05.2024
  • ETL Pipeline Development with PySpark and AWS Glue: Automated data ingestion, transformation, and storage using PySpark on AWS Glue, including data cleansing, type conversions, and filtering.
  • Query Performance Optimization: Improved query performance by transforming data into Parquet format, reducing query execution time by 50% in AWS Athena.
  • Database Integration: Loaded processed data into Amazon RDS (PostgreSQL/MySQL) for structured storage and transactional processing, supporting seamless reporting and analysis.
  • Scalable Data Processing with AWS EMR: Built an automated ETL pipeline using PySpark on AWS EMR, applying distributed data transformations, and schema modifications for efficient processing.
  • Front-End Development with React: Developed a responsive SPA and interactive data dashboard in React, integrating real-time data analysis, with smooth navigation and responsive design.

Education

B.E - Production Engineering

Birla Institute of Technology (BIT)
India
01.2021

Higher Secondary Certificate (HSC) -

D.A.V Patna
01.2017

Senior Secondary Certificate (SSC) -

D.A.V Patna
01.2015

Skills

  • SQL
  • PySpark
  • Amazon Web Services
  • AWS Glue
  • AWS EMR
  • Amazon Redshift
  • Airflow
  • Angular
  • React
  • JavaScript
  • OOPs
  • ES6
  • Redux
  • Bootstrap
  • JIRA
  • Scrum practices
  • REST APIs
  • JSON

Certification

  • Data Analytics Specialization, Trainity, 2022, Training includes hands-on experience on various tools like Tableau, SQL, Excel.
  • Google Data Analytics Certificate, Google, 2022, Completed 8 courses developed by Google., https://drive.google.com/drive/folders/1T-Ej4d7CGP3FEQOAR01UIluXkHCupe73

Immediate Joiner

True

Personal Information

  • Willing To Relocate: True
  • Date of Birth: 08/31/99
  • Gender: Male
  • Nationality: Indian
  • Marital Status: Single

Timeline

Associate

CODEJEE
09.2021 - 05.2024

B.E - Production Engineering

Birla Institute of Technology (BIT)

Higher Secondary Certificate (HSC) -

D.A.V Patna

Senior Secondary Certificate (SSC) -

D.A.V Patna
Pratyush Sahay