Summary
Overview
Work History
Education
Skills
CERTIFICATIONS
Timeline

RUSHIKESH GHATE

DATA ENGINEER
Pune

Summary

Data Engineer with 4.5+ years of hands-on experience in designing, building, and optimizing scalable, production-grade data pipelines on AWS. Strong expertise in Py-Spark, AWS Glue, S3, Redshift, and SQL, with proven success in ETL optimization, cloud migration, cost optimization, and data quality frameworks. Experienced in end-to-end pipeline ownership, performance tuning, and enabling analytics and BI teams with reliable, high-quality data.

Overview

6
6
years of professional experience

Work History

Data Engineer

SG Analytics
Pune
05.2021 - Current

Key Contributions and Impact

  • Designed and maintained production-grade data pipelines using Python, Py-Spark, AWS Glue, and S3, supporting batch processing, incremental loads, and BI consumption.
  • Contributed to the cloud migration of on-prem ETL workflows to AWS, improving scalability and reducing end-to-end execution time from 10 hours to 4–5 hours.
  • Optimized Py-Spark jobs through partitioning strategies, join optimization, memory tuning, and efficient file formats, significantly improving performance and cost efficiency.
  • Implemented data validation, anomaly detection, and quality checks, improving data reliability and consistency for downstream analytics.
  • Automated repetitive operational tasks using Python scripts, reducing 1-2 hours of daily manual effort, and minimizing human errors.
  • Designed analytics-ready data models and supported BI teams by delivering clean, well-structured datasets for Power BI and Tableau dashboards.
  • Collaborated with cross-functional teams (analytics, QA, and stakeholders) to ensure SLA-driven delivery and smooth production support.
  • Created and maintained technical documentation covering pipeline architecture, workflows, and troubleshooting guidelines.

Key Achievements

  • Reduced ETL runtime by 50–60% through Spark and workflow optimization.
  • Improved data freshness and reporting turnaround for business stakeholders.
  • Strengthened production stability through monitoring and validation frameworks.

FEA Analyst

Idemetrics Pvt Ltd
Pune
10.2017 - 09.2019
  • Performed data-driven structural analysis using ANSYS Workbench to evaluate design durability and compliance.
  • Conducted pre-processing and post-processing of large engineering datasets, generating detailed analytical reports.
  • Developed strong foundations in analytical thinking, data interpretation, and technical documentation, later applied to data engineering roles.

Education

PGP in Data Science & Engineering - Data Engineering

Great Learning, Pune
04.2001 -

Bachelor of Engineering - Mechanical

Bharati Vidyapeeth College of Engineering, Navi Mumbai
06-2015

Skills

Cloud & Big Data - AWS (S3, Glue, Athena, Lambda, IAM), ETL/ELT, data warehousing, data migration

Big Data & Processing: Py-Spark, Python, partitioning, job optimization, incremental loads

Programming - Python, MySQL / PL-SQL , Py-Spark, Spark SQL

Data Engineering: ETL Pipelines, Data Warehousing, Data Modeling, Data Lakes, Incremental Loads, Data Quality, Schema Validation

Orchestration & DevOps: AWS Step Functions, CI/CD, Git, GitHub Actions

Analytics & BI: Tableau, Power BI,KPI Modeling, Reporting Enablement

CERTIFICATIONS

  • AWS Cloud Practitioner Essentials (AWS)—Feb 2025
  • Building a Modern Data Warehouse - Data Engineering Bootcamp (Udemy)- Dec 2025
  • Microsoft Power BI: Practical Guide (Udemy)— 2024
  • Tableau 2020 for Data Science & Business Analytics (Udemy) — 2022

Timeline

Data Engineer - SG Analytics
05.2021 - Current
FEA Analyst - Idemetrics Pvt Ltd
10.2017 - 09.2019
Great Learning - PGP in Data Science & Engineering, Data Engineering
04.2001 -
Bharati Vidyapeeth College of Engineering - Bachelor of Engineering, Mechanical
RUSHIKESH GHATEDATA ENGINEER