Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
Rajat Dhawale

Rajat Dhawale

Nagpur

Summary

Highly skilled Data Engineer with strong expertise in Python, SQL, and scalable ETL/ELT pipeline development across AWS and GCP cloud platforms. Proven experience in building batch and streaming data pipelines, integrating Snowflake and cloud-native data services, and implementing CDC, SCD, and data quality frameworks. Adept at working across multiple client environments, collaborating with cross-functional teams, and delivering reliable, production-grade data solutions. Proficient in data warehousing, orchestration (Airflow), CI/CD automation, and cloud security/IAM, with foundational experience in machine learning (classification, regression) and GenAI integrations. Committed to delivering high-quality, scalable, and cost-efficient data platforms.

Overview

7
7
years of professional experience
1
1
Certification

Work History

Senior Data Engineer

Genpact
Bangalore (WFH), India
09.2023 - Current

Assisted in developing and maintaining data processing pipelines using Python and PySpark for batch and near-real-time ingestion across various data layers.

  • Supported in creating reliable data workflows on AWS services, including S3, Lambda, Glue, and Step Functions, for consistent data processing.
  • Contributed to the design and optimisation of Snowflake-based data platforms for structured data transformation.
  • Helped orchestrate workflows using Apache Airflow, enabling scheduling and automated data processing pipelines.
  • Conducted data validation and quality checks to ensure accurate analytical outputs.
  • Participated in supporting the development of data models for downstream analytics and reporting.
  • Aided in optimising performance through efficient query design and improved data processing strategies.
  • Worked with cross-functional teams to support data-driven decision-making.
  • Maintained documentation and runbooks to support pipeline reliability and issue resolution.

Senior Software Developer -

Persistent Systems
07.2021 - 09.2023

Assisted in building and maintaining Python-based data processing pipelines for ingesting, cleansing, and transforming healthcare datasets for search and analytical applications.

  • Supported the design and implementation of data transformation workflows to structure raw data into analysis-ready formats across multiple stages.
  • Helped create structured datasets to enable search functionality and facilitate downstream analytics use cases.
  • Participated in data migrations across PostgreSQL and MySQL, including schema mapping and data reconciliation.
  • Conducted data validation and quality checks to ensure reliable outputs.
  • Collaborated with team members and provided guidance to junior developers on Python development and data processing practices.
  • Maintained automation of data workflows using Python and scripting, with version-controlled processes.

Python Developer -

Go Digit General Insurance
10.2019 - 07.2021
Data Pipeline – Finance Domain Project.

Role: Backend Python Developer.|Applied machine learning.

  • Built Python-based backend pipelines to collect, clean, and store finance-related data from multiple web sources.
  • Implemented web scraping and automation using Scrapy, BeautifulSoup, Requests, and Selenium, handling background jobs, CAPTCHA challenges, and IP blocking scenarios.
  • Stored and managed processed data in PostgreSQL for analytics and model consumption.
  • Used AWS services (S3, Lambda, EC2) to support data storage, scheduled processing, and backend workflows.
  • Developed machine learning models using Python for business use cases: classification to categorise financial records (e.g. transaction type, risk category, and regression to predict numeric outcomes (e.g. amounts, trends, or scores.
  • Performed feature engineering, train/test splits, and model evaluation using standard Python ML libraries.
  • Automated jobs and model runs using Jenkins CI/CD and Python scripts.

Technologies: Python, AWS (S3, Lambda, EC2), PostgreSQL, Jenkins, Scrapy, Selenium, and Machine Learning (Classification, Regression).

Python Developer -

Shopkirana E Trading Pvt ltd
05.2019 - 10.2019
Customer Targeting and Retention Optimisation Project.
  • Built a customer targeting and retention system to boost sales using Python and Pandas for data analysis, summarisation, and visualisation.
  • Performed customer segmentation and behaviour analysis to identify high-potential customers and assign incremental sales targets.
  • Developed machine learning models for customer retention and propensity prediction, identifying customers likely to churn or to upsell.
  • Applied classification techniques to predict customer retention probability, and regression models to estimate expected purchase value.
  • Implemented feature engineering on customer demographics, purchase history, and engagement metrics.
  • Built a Django-based web application with a simple HTML/JavaScript UI to display customer insights, and model outputs.
  • Used SQL databases for storing customer data, model features, and prediction results.

Technologies: Python, Pandas, Machine Learning (Classification, Regression), SQL, Django, HTML, JavaScript, and Data Analysis.

Education

Bachelor of Engineering - Mechanical Engineering

Priyadarshani Institute of Engg & Technology
2017

H.S.C -

Maharashtra State Board
2013

SSC -

Maharashtra State Board
2011

Skills

Programming and analytical development
  • Python programming (7 years)
  • Object-oriented programming and modular design
  • Data-driven application development
Data processing and statistical analysis
  • Pandas and NumPy
  • Data cleaning, transformation, and exploratory data analysis
  • Statistical analysis and data modeling
Data visualization with Matplotlib
  • Analytical dashboards and visual insights
  • Web application development
Experience in web-based applications and GUI desktop applications
  • Database management and querying
SQL (PostgreSQL, SQL Server, Oracle, and Snowflake)
  • Data extraction and transformation for analytics
  • APIs and data integration
REST APIs using FastAPI and Flask
  • Integrating data sources into analytical workflows

Certification

  • AWS cloud practitioner
  • Google Cloud Certified Cloud Digital Leader
  • Microsoft Certified: Azure Fundamentals
  • Oracle Certification

Timeline

Senior Data Engineer

Genpact
09.2023 - Current

Senior Software Developer -

Persistent Systems
07.2021 - 09.2023

Python Developer -

Go Digit General Insurance
10.2019 - 07.2021

Python Developer -

Shopkirana E Trading Pvt ltd
05.2019 - 10.2019

Bachelor of Engineering - Mechanical Engineering

Priyadarshani Institute of Engg & Technology

H.S.C -

Maharashtra State Board

SSC -

Maharashtra State Board
Rajat Dhawale