Summary
Overview
Work History
Education
Skills
Certification
Extra-Curricular Activities
Languages
Work Availability
Work Preference
Timeline
Hi, I’m

Pawan Gosavi

Pune
Pawan Gosavi

Summary

Results-focused data professional with 5.5+ years of experience equipped for impactful contributions. Expertise in designing, building, and optimizing complex data products, data pipelines and ETL processes. Strong in SQL, Python, and cloud platforms, ensuring seamless data integration and robust data solutions. Known for excelling in collaborative environments, adapting swiftly to evolving needs, and driving team success.

Overview

6
years of professional experience
12
Certification

Work History

Estrel.ai

Consultant - Data Engineer
01.2025 - Current

Job overview

  • Client: Etihad Airways & Team: Data Hotshots [Data Engineering Team]
  • Monitored and maintained 128+ Azure Data Factory (ADF) pipelines, including 16 critical pipelines with 24/7 support roaster using PagerDuty.
  • Developed 5+ scalable data ingestion features using various Azure Services and Various API's.
  • Built and automated data ingestion using Microsoft Graph API from 6 SharePoint Online Lists into Synapse External Tables pointed to Parquet Tables as ADF Pipelines pushed through Azure DevOps.
  • Built, automated, and parsed data from 17 Excel files from 2 SharePoint Online sites into Delta Lake tables as Databricks Workflows.
  • Built, automated, parsed and processed 13 PDF Documents (e.g., aviation manuals, local regulations) using Gen AI tools (LlamaParse) for structured insights.
  • Tech Stack: Azure Data Factory, Azure Synapse Analytics, Azure Data Lake Storage Gen2, Azure Databricks, Azure SQL, LLM Models, Python, PySpark, Event Hubs, and Azure DevOps

Jindal Intellicom (JindalX)

Data Engineer - Manager
06.2023 - 01.2025

Job overview

  • Clients: PepsiCo, Compana, UTZ Snacks, WisdomAI
  • Part of Jindal's Elite "Wiz Team" and managed team of 3 Data Engineers.
  • Monitored and maintained 200+ Azure Data Factory (ADF) pipelines powering PepsiCo's Revolution 2.0 Dashboard for LATAM region.
  • Developed and replicated ADF pipelines for India and Egypt (AMESA) using LATAM pipeline architecture.
  • Implemented automated data governance policies including purging and archiving records older than 2 years.
  • Standardized character encoding to UTF-8 during ingestion to ensure global data consistency.
  • Built 67 Delta tables and 13 ADF pipelines from scratch for Compana's Pet brand Revenue Growth Tracker Dashboard.
  • Integrated machine learning regression outputs from Data Science team into Delta tables for business consumption.
  • Developed 32 Delta tables via 4 ADF pipelines for UTZ Snacks’ Competitive Impact Tracker and PPU Tracker (FMCG market).
  • Engineered core data infrastructure for WisdomAI’s multi-tier SaaS product (Gold, Silver, Bronze) across cross-markets.
  • Created 70+ Delta/Parquet tables, including ingestion of 20+ ML model outputs and FRED economic data via FRED API.
  • Built automated pipelines and implemented fiscal & 4-4-5 business calendar logic for advanced date aggregation (daily to yearly).
  • Leveraged External Hive Metastore using MS SQL Server for unified schema management across environments.

Systech Solutions Inc

Associate Software Engineer
07.2022 - 03.2023

Job overview

  • Clients: Tesla, Fox Sports
  • Monitored and maintained 200+ Apache Airflow DAGs and Informatica workflows for TeslaBI project; performed root-cause analysis and resolved job failures.
  • Implemented Slowly Changing Dimension (SCD) Type 2 logic in Informatica for historical data tracking.
  • Automated EOD failure alert reports via email using use of batch scripts to business stakeholders, improving transparency and SLA compliance.
  • Ingested live scoreboard data into Delta and Parquet tables using Azure Data Factory, Databricks, and REST APIs for Fox Sports.
  • Developed unit and integration tests across Databricks notebooks using unittest and pytest frameworks to ensure data reliability.
  • Conducted R&D on Chatbot integration using ChatGPT for natural language processing and querying of sports statistics.

Pillai's HOC

Data Science Researcher + Assistant Professor
06.2021 - 07.2022

Job overview

  • Conducted exploratory data analysis (EDA) on 5 years of admissions data to identify trends, seasonal patterns, and decision-making factors.
  • Developed predictive models to forecast potential applicants and designed data-driven targeted ad campaigns to improve enrollment efficiency.
  • Part of the Data Engineers team to manage an automated attendance system, Data Ingestion from Google Meet plugins into SQL databases using Python Notebooks during the COVID-19 remote learning phase.
  • Delivered hands-on lab sessions for Master’s students in Big Data Analytics, Machine Learning, Internet of Things (IoT), and Security in Computing.
  • M.Sc.IT - Created a online study material website for students based on WIX.
  • Mentored students on academic projects involving data science workflows, predictive modeling, and real-time system design.

Capgemini India

Associate Software Engineer
11.2019 - 02.2021

Job overview

  • Extracted and parsed metadata for 4,000+ movies and TV shows Reviews for Germany Based Movie Review Website using APIs like IMDbPY, RottenTomatoes-Python, and OMDb API.
  • Preprocessed text data using Natural Language Processing (NLP) techniques to build a clean and structured corpus for downstream machine learning models.
  • Stored processed datasets in Excel format in Azure Data Lake Storage Gen2 for scalable access and archival.
  • Designed and implemented ETL workflows to ingest enriched data into Azure CosmosDB for real-time consumption by recommendation engine.
  • Showcased small demo to 1 potential client for an AI Automated Audio / Subtitles Dubbing using Various methods of NLP.
  • Collaborated with Machine Learning team to ensure high-quality, structured input data for training recommendation algorithms.

Education

University of Mumbai
Mumbai

Master of Science from Information Technology
04-2021

University Overview

  • CGPA: 8.67 @ ~ 82.36% [AI Specialization]

University of Mumbai
Mumbai, India

Bachelor of Science from Computer Science
04-2019

University Overview

  • CGPA: 8.86 @ ~ 84.17% [Data Science Specialization]

Skills

  • Python / R
  • Apache Spark / PySpark
  • SQL / Advanced SQL
  • Azure Databricks
  • Azure Data Factory
  • Azure Synapse Analytics
  • Azure Data Lake Storage Gen2
  • Azure Stream Analytics / Evenhub
  • Azure SQL Database / MS SQL Server
  • ETL / Data Pipelines
  • Azure DevOps (CI/CD)
  • Data Modeling / Data Warehousing

Certification

  • Computer Basics & Application [08/10]
  • Computer Application & Desktop Publishing [08/10]
  • Maharashtra State - Certificate in Information Technology [06/14]
  • Hardware and Networking Professional [12/16]
  • Certificate of Data Analysis and Predictive Modelling [05/21]
  • Certificate of Managing Online Classes & Co- creating MOOCS [10/21]
  • Certificate of The G Suite for Education [11/21]
  • Tableau Desktop Certified Professional [06/22]
  • Databricks - Certification in Generative AI Fundamentals [11/24]
  • Databricks - Certification in Databricks Fundamentals [11/24]
  • Databricks - Certification in Azure Databricks Platform Architect [11/24]
  • Informatica - Data Engineering Foundation Certification [11/24]
  • Microsoft Certified: Fabric Data Engineer Associate [DP700] [Ongoing]

Extra-Curricular Activities

Extra-Curricular Activities
  • Participated and won Champion Trophy in E – Fest [Techno phoenix] in FY, & SY.
  • Participated in Pillar's Alegria 2021, Won the Thunder Board Tournament.
  • Love playing Chess, Traveling, I also like Trains, in free time do Standup Comedy.

Languages

English
Bilingual or Proficient (C2)
Hindi
Bilingual or Proficient (C2)
Marathi
Bilingual or Proficient (C2)
Availability
See my work availability
Not Available
Available
monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse

Work Preference

Work Type

Full Time

Work Location

On-SiteRemoteHybrid

Timeline

Consultant - Data Engineer
Estrel.ai
01.2025 - Current
Data Engineer - Manager
Jindal Intellicom (JindalX)
06.2023 - 01.2025
Associate Software Engineer
Systech Solutions Inc
07.2022 - 03.2023
Data Science Researcher + Assistant Professor
Pillai's HOC
06.2021 - 07.2022
Associate Software Engineer
Capgemini India
11.2019 - 02.2021
University of Mumbai
Master of Science from Information Technology
University of Mumbai
Bachelor of Science from Computer Science
Pawan Gosavi