Summary
Skills
Certification
Overview
Work History
Education
Timeline
Work Availability
Hi, I’m

Mridu Singh

Senior Data Engineer
Noida
Mridu Singh

Summary

Senior Data Engineer with 10+ years of experience building scalable data platforms using Databricks, Snowflake, and Informatica. Skilled in developing PySpark-based data pipelines and Delta Lake architectures, and modernizing legacy ETL systems into cloud-native solutions.

Strong expertise in performance optimization, data modeling, and pipeline orchestration, supporting large-scale analytics in the insurance domain.

Hands-on experience with GenAI, including a RAG-based POC on Databricks for natural language data querying.

Skills

Databricks: PySpark, Delta Lake, Workflows, Unity Catalog

Cloud & Warehousing: Snowflake, AWS (S3, Glue)

ETL Tools: Informatica PowerCenter, IICS/CDI, SSIS

Programming: Python, SQL (Advanced), PL/SQL

Data Engineering: Data Pipelines, Medallion Architecture, Data Modeling

Optimization: Query Tuning, Partitioning, Caching

GenAI: RAG, LLM Integration, Vector-Based Retrieval

Certification

Databricks Certified Data Engineer Associate

Overview

10
years of professional experience
4
Certificates

Work History

LTM (Former LTIMINDTREE)

Senior Data Engineer
10.2024 - Current

Job overview

  • Architected Databricks Lakehouse pipelines using Delta Lake, enabling scalable processing of high-volume insurance data from AWS S3.
  • Built distributed PySpark pipelines for transforming structured and semi-structured data into ACID-compliant Delta tables.
  • Designed Medallion Architecture (Bronze → Silver → Gold) with incremental processing for reliable and optimized data consumption.
  • Improved performance by 25–30% using partitioning, caching, broadcast joins, and Spark optimization techniques.
  • Led migration from Informatica/SSIS to Databricks + Snowflake, redesigning ETL into scalable cloud-native pipelines.
  • Implemented data quality frameworks, PII masking, and governance controls, ensuring >99.9% data accuracy.
  • Built a GenAI-based RAG POC on Databricks, enabling natural language querying of enterprise datasets using LLMs and vector-based retrieval.

LTM (Formerly LTIMINDTREE)

Data Engineer
11.2021 - 09.2024

Job overview

  • Designed and developed scalable ETL pipelines using Informatica IICS/CDI and Snowflake, enabling efficient ingestion and transformation of enterprise insurance data.
  • Implemented Snowflake data warehousing solutions, including schema design, data modeling, and query optimization using clustering techniques.
  • Improved ETL performance by over 30% through pushdown optimization, efficient SQL design, and parameterized workflows.
  • Developed business-ready data marts in Snowflake for underwriting, claims, and fraud analytics.
  • Implemented data validation, reconciliation, and audit frameworks, ensuring data accuracy and consistency across systems.

Birlasoft

Senior ETL Developer
11.2018 - 11.2021

Job overview

  • Developed and maintained ETL workflows using Informatica PowerCenter for large-scale insurance datasets.
  • Built and optimized SQL/PLSQL logic in Oracle for complex data transformations.
  • Improved ETL performance by ~35% using query optimization, indexing, and workflow tuning.
  • Implemented reusable components and parameterized workflows to enhance scalability.
  • Performed data validation and reconciliation, ensuring >99.8% data accuracy.

Cognizant Technologies Solutions

Programmer Analyst
07.2014 - 01.2017

Job overview

  • Developed ETL workflows using Informatica PowerCenter for insurance data processing.
  • Wrote and optimized SQL (Oracle/SQL Server) for data transformation and validation.
  • Supported batch processing, data integration, and production issue resolution.

Education

Krishna Engineering College
Ghaziabad, India

Bachelor of Technology from Computer Science
06-2014

Timeline

Databricks Certified Data Engineer Associate

02-2026

Databricks Certified Generative AI Engineer Associate

02-2026

Senior Data Engineer

LTM (Former LTIMINDTREE)
10.2024 - Current

Cloud Data Integration for PowerCenter Developers – Foundation Certificate

07-2024

Microsoft Certified: Fabric Analytics Engineer Associate

06-2024

Data Engineer

LTM (Formerly LTIMINDTREE)
11.2021 - 09.2024

Senior ETL Developer

Birlasoft
11.2018 - 11.2021

Programmer Analyst

Cognizant Technologies Solutions
07.2014 - 01.2017

Krishna Engineering College

Bachelor of Technology from Computer Science
Availability
See my work availability
Not Available
Available
monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse
Mridu SinghSenior Data Engineer