Summary
Overview
Work History
Skills
Projects
Certifications
Education
Timeline
Generic

Dnyanda Prabhune

Pune

Summary

Data Engineer with three years of backend development expertise, building high-performance systems. Specializes in developing scalable data pipelines using Java, Spark, and Kafka to transform raw data into analytics-ready solutions. Proficient in the full modern data stack, including Databricks, Snowflake, and Airflow, with hands-on expertise in orchestrating end-to-end data workflows.

Overview

3
3
years of professional experience

Work History

Senior System Associate

Infosys Ltd
Pune
12.2021 - 04.2025
  • Created backend solutions in Java for distributed data pipelines, ensuring efficient and rapid data processing.
  • Optimized SQL queries, relational database schemas, and indexing strategies to enhance query performance by 30%.
  • Implemented data ingestion and transformation logic to enhance data flow.
  • Applied code optimization techniques to enhance data workflow efficiency.

Skills

  • Programming and querying: Python, SQL
  • Data engineering and ETL: Airflow, dbt, NiFi, Spark, AWS Glue
  • Streaming and messaging: Kafka, Kafka Connect
  • Databases and warehousing: Snowflake, Redshift, PostgreSQL, MySQL
  • Cloud platforms: AWS (S3, EC2, IAM, Glue, Lambda, CloudWatch), GCP (BigQuery, Cloud Storage)
  • Data modeling and orchestration: star/snowflake schema, dimensional modeling, DAG scheduling
  • DevOps and automation: Git, Docker, CI/CD
  • Visualization: Power BI, Tableau

Projects

Real-time stock data streaming pipeline|Kafka, Python, AWS S3, and Snowflake

  • Developed a Kafka-based data streaming solution using Python, automating the flow of stock data from producer to consumer, and writing streaming records to AWS S3 in JSON format
  • Automated ingestion and transformation with Python and Snowflake, enabling near real-time analytics
  • Secured pipeline with IAM roles and logging, ensuring scalability and reliability

E-commerce data pipeline| Spark (PySpark), Databricks, Delta Lake

  • Designed an ETL pipeline in PySpark on Databricks to ingest, clean, and transform multi-source e-commerce datasets
  • Produced analytics-ready Delta/Parquet tables for sales trends, customer segmentation, and top-product insights
  • Optimized workflows for large-scale data, supporting future scalability, and BI integration

Certifications

  • Databricks Lakehouse FundamentalsDatabricks Academy, 2025
  • Google Data Analytics CertificateGoogle, 2024
  • Microsoft Certified: Power BI Data Analyst AssociateMicrosoft, 2024
  • Snowflake data warehousing, 2025
  • Apache Kafka, 2025
  • Data engineering (aligned with Microsoft DP-203), 2024

Education

  • Master of Science — Savitribai Phule Pune University, Pune (Sep 2022)

Timeline

Senior System Associate

Infosys Ltd
12.2021 - 04.2025
Dnyanda Prabhune