Summary
Overview
Work History
Education
Skills
Certification
Timeline
Languages Known
Generic

Vibhu Kumaran

Data Engineer
Chennai

Summary

Aspiring Data Engineer and QA Analyst with three years of experience, specializing in ETL processes using tools such as Azure Data Factory, AWS, and Microsoft Fabric. Proficient in PySpark and SQL for data manipulation, warehousing, and modeling, with a strong ability to transform complex datasets into actionable insights. Experienced in SQL Server Management Studio for data transformations and leveraging Power BI for comprehensive data analysis. Committed to continuous learning and applying innovative solutions to enhance data-driven decision-making.

Overview

3
3
years of professional experience
3
3
Certifications

Work History

Junior Associate

KANINI
02.2025 - Current
  • Built end-to-end ETL/ELT pipelines in Microsoft Fabric using Medallion Architecture, supporting Data Ingestion, Source Connection, Transformation, Lakehouse/Warehouse loading, and Semantic Modeling.
  • Worked in PySpark and Spark-based in tools like AWS ( Glue, S3, Trino) and Azure Data Factory transformations for large-scale data processing using DataFrames, Delta Lake, Delta and Parquet file format in distributed data environments.
  • Implemented Incremental loading, CDC logic, SCD Type 1/2 handling, and metadata-driven ETL pipelines for scalable data engineering solutions.
  • Performed Data Validation, Data reconciliation, ETL testing, QA/UAT support, and applied Data Governance controls to ensure data quality and integrity.
  • Managed end-to-end data engineering architecture in research and HealthCare domains, including OneLake storage, Pipeline Orchestration, and Performance Optimization.
  • Developed and optimized complex SQL queries, CTEs, Stored Procedures, and Joins for Healthcare Data transformation, Validation, and Performance Tuning using Sql Server Managemnet Studio.

Trainee Associate

KANINI
11.2023 - 01.2025
  • Engineered data solutions and data validations in Azure Synapse for the Healthcare domain project.
  • Executed data loading and transformation from raw to bronze, and bronze to silver tiers using PySpark, RDD, SSIS, and SQL.

Internship Trainee

KANINI
06.2023 - 10.2023
  • Worked in assisting Data Valaidation, Data Mapping, Reverse Engineering and Data Modelling.
  • Worked and assisted various Data Visualization Projects using PowerBI and Tableau.

Education

Bachelor Of Technology - BTech - Computer Science

SRM University
Kattankulathur
05-2023

Skills

  • Programming Languages: SQL, Python, PySpark
  • Data Engineering: ETL/ELT Development, Data Pipeline Orchestration, Data Ingestion, Data Transformation, Incremental Loading, Metadata-Driven ETL,Change Data Capture (CDC), Source-to-Target Mapping (STTM)
  • Big Data & Processing: Apache Spark, PySpark, Spark Data Frames, Delta Lake / Delta Tables, Parquet, Distributed Data Processing
  • Cloud & Data Platforms: Microsoft Fabric, AWS, Azure Data Factory, SQL Server Management Studio (SSMS)
  • Data Architecture & Modeling: Medallion Architecture, Star Schema, Snowflake Schema, Semantic Modeling, SCD Type 1 & Type 2
  • SQL Development: Complex Joins, Common Table Expressions (CTEs), Stored Procedures, Views, Window Functions, Query Optimization

Certification

Microsoft Certified: Fabric Analytics Engineer Associate DP 600

Timeline

Junior Associate

KANINI
02.2025 - Current

Trainee Associate

KANINI
11.2023 - 01.2025

Internship Trainee

KANINI
06.2023 - 10.2023

Bachelor Of Technology - BTech - Computer Science

SRM University

Languages Known

  • English
  • Tamil
  • Hindi
  • Marathi
Vibhu KumaranData Engineer