Summary
Overview
Work History
Education
Skills
Certification
ADDITIONAL HIGHLIGHTS
Timeline
Generic

Vaibhav Sarkate

Senior Business Analyst / Data Engineer
Hyderabad,TG

Summary

Data Engineer with 4+ years of experience designing, building, and maintaining scalable, production-grade data pipelines across enterprise environments. Strong hands-on expertise in Apache Spark (PySpark), SQL, Python, ETL automation, and cloud-ready architectures, with growing specialization in Databricks-style lakehouse patterns. Proven experience working with high-volume financial and ERP datasets, integrating relational systems via JDBC, implementing data quality and reconciliation frameworks, and optimizing performance for large-scale transformations.

Overview

4
4
years of professional experience
5
5
Certifications

Work History

Senior Business Analyst / Data Engineer

Ryan India
11.2021 - Current
  • Designed, developed, and maintained 150+ automated ETL pipelines for financial and operational reporting
  • Built SQL-based data ingestion and transformation workflows supporting large-scale SAP financial datasets (BSEG, BKPF)
  • Implemented data reconciliation and quality checks, improving reporting accuracy by 35%
  • Reduced manual data preparation effort by 60% through automation using SQL, Python, SSIS, and Alteryx
  • Optimized SQL queries, stored procedures, and views for performance and scalability
  • Delivered analytics-ready datasets enabling self-service BI and faster month-end close cycles
  • Collaborated with finance, risk, operations, and IT teams to translate business requirements into robust data solutions
  • Maintained detailed documentation of ETL logic, data mappings, and validation processes
  • Resolved complex data quality issues including malformed files, right-ragged data, and delimiter inconsistencies
  • Client Highlight – JPMorgan Chase & Co.
  • Built and optimized an SSIS-based Real-Time Load (RTL) pipeline to ingest and process high-volume SAP financial data into SQL Server for downstream analytics

Education

Post Graduate Diploma - Big Data Analytics (PG-DBDA)

CDAC Pune-Karad | SunBeam Institute of Information Technology
01-2021

Bachelor of Engineering - Electronics & Communication Engineering

Rashtrasant Tukadoji Maharaj Nagpur University
01-2019

Skills

Apache Spark (PySpark), Spark SQL

ETL Pipeline Design (Read → Transform → Write)

Performance optimization on distributed Data

Parquet, Structured Data Processing

SQL Server MySQL (JDBC integration)

Data Modelling, Data warehousing & Schema Design

Azure Databricks

Certification

Databricks Data Engineer Associate

ADDITIONAL HIGHLIGHTS

  • Strong foundation in enterprise data engineering workflows
  • Hands-on Spark development outside work through GitHub portfolio
  • Experience bridging legacy SQL systems with modern big data platforms

Timeline

Databricks Data Engineer Associate
01-2026
Microsoft Azure Fundamentals (AZ-900)
11-2025
Alteryx Designer Certification
11-2024

Senior Business Analyst / Data Engineer

Ryan India
11.2021 - Current

Post Graduate Diploma - Big Data Analytics (PG-DBDA)

CDAC Pune-Karad | SunBeam Institute of Information Technology

Bachelor of Engineering - Electronics & Communication Engineering

Rashtrasant Tukadoji Maharaj Nagpur University
Vaibhav SarkateSenior Business Analyst / Data Engineer