Summary
Overview
Work History
Education
Skills
Projects
Certification
Timeline
Generic

Pratibha Nimbolkar

Bangalore

Summary

Results-driven Data Engineer with over 1 year of experience in building and optimizing scalable ETL pipelines using Databricks, Apache Spark, and SQL. Skilled in transforming complex data into actionable insights through efficient, high-performance workflows. Strong expertise in end-to-end pipeline development, with a focus on automation, scalability, and data integrity.

Overview

1
1
year of professional experience
1
1
Certification

Work History

Associate Engineer

Diggibyte Technolgies Pvt Ltd
Bengaluru
03.2024 - Current
  • Develop and maintain data pipelines for extracting, transforming, and loading (ETL) data from various sources, such as Azure SQL Database, Oracle SQL, and ADLS, ensuring data accuracy and integrity.
  • Implemented and optimized workflows using Databricks Workflows and Azure Data Factory, automating data processing tasks for improved scalability and efficiency.
  • Apply data transformations and business logic to process and structure data using PySpark and SQL, ensuring consistency across multiple data layers (Bronze, Silver, Gold).
  • Monitor and troubleshoot data pipelines, ensuring data quality through profiling, validation, and cleansing, and supporting reliable data delivery for analytics and reporting.

Education

Master of Computer Application -

Vellore Institute of Technolgy
08-2024

Post Graduate Diploma of Computer Application -

Makhanlal Chaturvedi National University
12.2021

Bachelor of Science - in Computer Science

Motilal Vigyan Mahavidyalya
06.2020

Skills

  • Data Engineering Tools & Platforms
    Databricks, Apache Spark, Azure Synapse Analytics, Azure Data Lake Storage (ADLS)
  • Programming & Query Languages
    SQL, Python (including PySpark)
  • ETL & Data Integration
    ETL Pipeline Development, Data Transformation, Data Ingestion, Data Migration, Batch & Streaming Data
  • Cloud Platforms & Services
    Microsoft Azure (Azure Data Factory, Azure SQL Database, Azure Functions)
  • Orchestration & Workflow Tools
    Azure Data Factory, Databricks Workflows
  • Version Control & CI/CD
    Git, GitHub, CI/CD pipelines (Azure DevOps)

Projects

ETL pipeline migration from Azure Synapse to Databricks

  • Led the migration of ETL pipelines from Azure Synapse to Azure Databricks for product and finance data
  • Extracted data from Azure SQL Database, Oracle SQL Database, and ADLS, transformed it using PySpark, and loaded it into Delta Lake
  • Replaced Synapse pipeline orchestration with Databricks Workflows, enabling automated and streamlined processing
  • Applied complex business logic and data transformations, ensuring data accuracy and integrity across systems

Retail data pipeline development using Medallion architecture

  • Designed and implemented a data pipeline using Medallion Architecture to process raw JSON data with custom schema definitions
  • Ingested, cleaned, and validated data to ensure accuracy and consistency, applying data explosion techniques to flatten nested structures
  • Performed business rule-based transformations and loaded refined data into the target data warehouse to support analytics and reporting
  • Improved data quality through rigorous profiling, validation, and cleansing, enhancing the reliability of downstream insights

Certification

  • Databricks Certified Data Engineer Associate, Databricks, 2
  • Databricks Lakehouse Fundamental, Databricks
  • Learning Java11, LinkdIn
  • AWS Cloud Practitioner essentials, Coursera

Timeline

Associate Engineer

Diggibyte Technolgies Pvt Ltd
03.2024 - Current

Master of Computer Application -

Vellore Institute of Technolgy

Post Graduate Diploma of Computer Application -

Makhanlal Chaturvedi National University

Bachelor of Science - in Computer Science

Motilal Vigyan Mahavidyalya
Pratibha Nimbolkar