Vaibhav Sarkate

Senior Business Analyst / Data Engineer

Hyderabad,TG

Summary

Data Engineer with 4+ years of experience designing, building, and maintaining scalable, production-grade data pipelines across enterprise environments. Strong hands-on expertise in Apache Spark (PySpark), SQL, Python, ETL automation, and cloud-ready architectures, with growing specialization in Databricks-style lakehouse patterns. Proven experience working with high-volume financial and ERP datasets, integrating relational systems via JDBC, implementing data quality and reconciliation frameworks, and optimizing performance for large-scale transformations.

Overview

years of professional experience

Certifications

Work History

Senior Business Analyst / Data Engineer

Ryan India

11.2021 - Current

Designed, developed, and maintained 150+ automated ETL pipelines for financial and operational reporting
Built SQL-based data ingestion and transformation workflows supporting large-scale SAP financial datasets (BSEG, BKPF)
Implemented data reconciliation and quality checks, improving reporting accuracy by 35%
Reduced manual data preparation effort by 60% through automation using SQL, Python, SSIS, and Alteryx
Optimized SQL queries, stored procedures, and views for performance and scalability
Delivered analytics-ready datasets enabling self-service BI and faster month-end close cycles
Collaborated with finance, risk, operations, and IT teams to translate business requirements into robust data solutions
Maintained detailed documentation of ETL logic, data mappings, and validation processes
Resolved complex data quality issues including malformed files, right-ragged data, and delimiter inconsistencies
Client Highlight – JPMorgan Chase & Co.
Built and optimized an SSIS-based Real-Time Load (RTL) pipeline to ingest and process high-volume SAP financial data into SQL Server for downstream analytics

Education

Post Graduate Diploma - Big Data Analytics (PG-DBDA)

CDAC Pune-Karad | SunBeam Institute of Information Technology

01-2021

Bachelor of Engineering - Electronics & Communication Engineering

Rashtrasant Tukadoji Maharaj Nagpur University

01-2019

Skills

Apache Spark (PySpark), Spark SQL

ETL Pipeline Design (Read → Transform → Write)

Performance optimization on distributed Data

Parquet, Structured Data Processing

SQL Server MySQL (JDBC integration)

Data Modelling, Data warehousing & Schema Design

Azure Databricks

Certification

Databricks Data Engineer Associate

ADDITIONAL HIGHLIGHTS

Strong foundation in enterprise data engineering workflows
Hands-on Spark development outside work through GitHub portfolio
Experience bridging legacy SQL systems with modern big data platforms

Timeline

Databricks Data Engineer Associate

01-2026

Microsoft Azure Fundamentals (AZ-900)

11-2025

Alteryx Designer Certification

11-2024

Senior Business Analyst / Data Engineer

Ryan India

11.2021 - Current

Post Graduate Diploma - Big Data Analytics (PG-DBDA)

CDAC Pune-Karad | SunBeam Institute of Information Technology

Bachelor of Engineering - Electronics & Communication Engineering

Rashtrasant Tukadoji Maharaj Nagpur University