Summary
Overview
Work History
Education
Skills
Websites
Open Source Projects:
Accomplishments
Affiliations
Languages
Timeline
Generic

Manasi Kulkarni

Pune

Summary

Data Engineer with 2.1+ years of industry experience including intenship in Data Engineering and Data Warehousing, specializing in scalable infrastructure development and cloud migration. Skilled in both Azure and GCP ecosystems, delivering high-quality data solutions independently and as part of a team.

  • Led Teradata-to-Azure Synapse migrations with ETL workflows in Azure Data Factory and Databricks.
  • Experienced in big data processing using PySpark and Medallion Architecture for data lakes.
  • Proficient in real-time analytics with GCP, integrating Pub/Sub with BigQuery and Dataflow SQL for streaming data solutions.

Overview

3
3
years of professional experience

Work History

Programmer Analyst | Cloud Data Engineer

Cognizant Technology Solutions
Pune
03.2023 - Current

PepsiCo

  • Spearheaded data migration initiatives from Teradata to Azure Synapse, collaborating with cross-functional teams to ensure alignment with data governance standards and business goals.
  • Designed and deployed scalable ETL pipelines in Azure Data Factory, leveraging Data Flow and Databricks for complex transformations, schema mapping, and data type conversions to ensure data integrity.
  • Implemented Medallion Architecture for data lakes, enhancing data quality and supporting advanced analytics across projects.
  • Optimized large-scale data migration through incremental loading strategies, minimizing downtime, and enhancing performance.
  • Developed and maintained Databricks notebooks for data processing and analysis, using PySpark for data cleaning, transformation, and distributed processing.
  • Established robust error-handling procedures and automated monitoring for pipeline reliability, reducing operational downtime.
  • Automated existing manual work using Python, which reduced the time consumption of a month for a single application to 2 hours for each application.
  • Documented best practices for data migration and led knowledge-sharing sessions to foster collaboration.

Technologies: Azure Data Factory (ADF), Databricks, Python, PySpark, SSMS, Azure Storage Explorer, WinSCP, PuTTY.

GCP Data Engineer Intern

Cognizant Technology Solutions
Pune
02.2022 - 08.2022

Hands-on Experience with Google Cloud Platform (GCP).

  • Partitioned Tables: Implemented partitioning strategies to optimize query performance and manage data storage.
  • DataProc to BigQuery: Converted workflows from DataProc to BigQuery for efficient data processing and analysis.
  • Streaming Data with Dataflow SQL: Utilized Dataflow SQL to join streaming data, enabling real-time analytics and insights.
  • Pub/Sub to BigQuery Integration: Integrated Pub/Sub with BigQuery to streamline ingestion of real-time data for analysis.

Education

MTech - Data Science & Engineering

BITS Pilani | WILP
Pilani
04-2026

BTech - Electronics And Telecommunication Engineering

PES Modern College of Engineering, Pune
Pune
06-2022

Skills

  • Languages: Python, PySpark, SQL
  • Databases: MySQL, Teradata, SQL Server
  • ETL & Big Data: Azure Databricks, Azure Data Factory, Apache Hadoop, and Apache Spark
  • Cloud & Data Services: Azure Data Lake, Azure Synapse, GCP (BigQuery, DataProc, Dataflow, Pub/Sub), Azure DevOps
  • Other: Data Modeling and Analysis

Open Source Projects:

Data Ingestion Pipeline: Weather Data Acquisition

Technologies: Python, MySQL, API Integration (OpenWeatherMap)

· Developed a Python script to retrieve and parse weather data from the OpenWeatherMap API.

· Designed and implemented logic for API requests, handling JSON responses efficiently.

· Established a MySQL database connection, creating SQL queries for seamless data insertion.

Data Analysis Project: Customer Segmentation for Credit Card Marketing

Technologies: Python, Pandas, SQL

· Analyzed customer data to segment users based on credit card ownership for targeted marketing.

· Performed data cleaning and pre-processing to ensure high-quality data for analysis.

· Leveraged Pandas to identify trends and segment customers by financial profile.

Accomplishments

    Gen AI Hackathon winner at Cognizant

    Project: Multilingual AI Assistant

  • Developed an automatic language detection feature for seamless customer interactions across multiple languages.
  • Created an AI system that summarizes video meetings and knowledge-sharing sessions into concise text and audio summaries (e.g., 3 minutes).
  • Built a generative AI model to automatically generate code based on meeting discussions in Python, Java, and SQL.

Affiliations

Startup, company and community onboarding & Social Media Designer
Google Developer's Group (GDG) Pune

  • Facilitated onboarding processes for startups and community members, enhancing engagement and integration within the tech ecosystem.
  • Designed engaging social media content to promote events and initiatives, enhancing community outreach and participation.

Technical Production Team Lead
IWD’23 by Google Developers Group Pune & Women TechMakers, Pune

  • Led the technical production team for IWD’23, ensuring seamless execution and coordination of event activities.

Podcast Team Lead
IWD’23 by Google Developers Group Pune & Women TechMakers, Pune

  • Managed the podcast team, curating content and facilitating recordings to showcase women's contributions to technology and promote event objectives.

Languages

Marathi
First Language
English
Proficient (C2)
C2
Hindi
Proficient (C2)
C2

Timeline

Programmer Analyst | Cloud Data Engineer

Cognizant Technology Solutions
03.2023 - Current

GCP Data Engineer Intern

Cognizant Technology Solutions
02.2022 - 08.2022

MTech - Data Science & Engineering

BITS Pilani | WILP

BTech - Electronics And Telecommunication Engineering

PES Modern College of Engineering, Pune
Manasi Kulkarni