Summary
Overview
Work History
Education
Skills
Certification
LANGUAGES
Timeline
background-images
Bhupendra Baraiya

Bhupendra Baraiya

Mumbai

Summary

Data Engineering and Data Platform Architect with over 12+ years of
experience in building and modernizing enterprise data pipelines
and platforms on AWS and Snowflake. Proven expertise as a handson solution architect and people leader, adept at managing teams
and mentoring talent while driving platform modernization
initiatives. Specializes in data ingestion techniques, including CDC
and streaming, as well as ELT/ETL automation, orchestration, and
infrastructure-as-code practices. Proficient in Python, SQL, Kafka,
Terraform, Airflow, and Snowflake performance patterns, with a
strong track record of collaborating with engineering and business
stakeholders to translate complex requirements into scalable,
testable data products.

Overview

14
14
years of professional experience
1
1
Certification

Work History

Solution Architect / Lead Data Engineer

phData
09.2025 - Current
  • Led discovery and solution design to modernize a legacy SQL Server + Data Vault-style warehouse into a scalable dimensional model on Snowflake, aligned to a medallion architecture (bronze/silver/gold).
  • Defined end-to-end delivery plan from discovery to production rollout and hyper care; broke down epics/stories, created sprint plans, and owned execution governance.
  • Built transformation pipelines in Coalesce (client-preferred) to ingest from SQL Server and JSON-based sources, generating standardized Snowflake SQL and reusable patterns.
  • Owned source-to-target mapping specifications for each replacement object (including cases where one target replaces multiple legacy objects); facilitated walk throughs and sign-offs with client stakeholders.
  • Established parallel-run validation approach: reconciled legacy vs new outputs, logged gaps, drove root-cause analysis, and delivered fixes to meet business expectations before cutover.
  • Managed 2 direct-report engineers for day-to-day delivery (stand ups, reviews) and mentored an additional 5 engineers on milestones, learning paths, and career progression.
  • Client-facing

Technical Architect

Tech Mahindra
05.2022 - 09.2025
  • Developed Python utilities to read JSON configuration and generate automated shell scripts for Docker builds and deployments to AWS ECR.
  • Implemented real-time reconciliation between SQL Server and Snowflake by concurrently invoking multiple AWS Lambda functions using Python asyncio for proactive pipeline monitoring.
  • Led Agile delivery and day-to-day client coordination, ensuring predictable sprint execution and on-time releases.
  • Architected a unified, configuration-driven ingestion platform using AWS Glue (Spark) + Confluent Kafka Connect to orchestrate multiple ingestion patterns (Kafka->S3->Iceberg, SQL->S3->Iceberg, and custom transformations).
  • Built infrastructure-as-code deployment pipelines with Terraform integrated into GitHub Actions and Azure DevOps for consistent environments.
  • Built Dockerized Debezium CDC pipelines on AWS MSK/ECS and Confluent Kafka to replicate data near real time into S3 data lake, RDS, and Snowflake.
  • Implemented dbt transformation models on Snowflake and basic dbt tests (schema/null checks) to improve data quality and reliability.
  • Orchestrated end-to-end AWS workflows using Airflow DAGs/APIs, handling dependencies, retries, and automated failure notifications.
  • Client sites

Lead Consultant

Xebia IT Architects
04.2021 - 05.2022
  • Designed integration/ETL flows (Workato recipes) across SaaS platforms such as Box and Salesforce for near real-time uploads and processing.
  • Built Python data validation pipelines using pandas and openpyxl to cleanse Excel-based datasets, enforce schema rules, and flag invalid records.
  • Developed Python-based AWS Lambda APIs (Requests, Simple Salesforce) to ingest validated datasets into Salesforce CRM with quality checks.

Senior Consultant

Deloitte Consulting India Pvt. Ltd.
08.2018 - 03.2021
  • Designed AWS-based ingestion patterns using Glue, Lambda, API Gateway, DMS, and PySpark to land data from APIs, Oracle, and FTP/SFTP into S3.
  • Built data lineage and metadata services in Python/Flask, exposing parameter-driven APIs for UI visualization.
  • Contributed to a data marketplace platform: built Node.js APIs, managed deployments on Heroku, and integrated authentication-enabled APIs using AWS services.

Senior Software Engineer II

Continuum Managed Solutions Pvt. Ltd.
03.2014 - 07.2018
  • Designed and implemented a ticketing data warehouse using Toad Data Modeler, supporting Snowflake and star-schema designs.
  • Developed Talend ETL jobs to ingest APIs/JSON/XML/SQL Server data into dimensional (fact/dim) models.
  • Built reusable Talend components for error handling, logging, and Snowflake integration.
  • Created Tableau dashboards and JasperSoft reports to improve KPI visibility; increased service delivery success rate from 70% to 97%.

Software Developer

Parekh Integrated Services India Pvt. Ltd.
07.2012 - 03.2014
  • Designed tables and functions in PostgreSQL to support operational and reporting workloads.
  • Scheduled Linux jobs and built Talend flows to move data between PostgreSQL and SQL Server.
  • Built and configured Crystal Reports and SQL Server 2008 reports for invoice printing.

Education

MCA -

Sikkim Manipal University (SMU)
Mumbai
01.2014

Diploma of Higher Education - Software Engineering (GNIIT)

NIIT
Mumbai
01.2010

B.Com - undefined

Nagindas Khandwala College of Arts and Commerce
Mumbai
01.2010

Skills

  • Core Data Engineering: Python, SQL, AWS, Kafka, Terraform
  • Snowflake (Hands-on): SQL transformations (ELT), RBAC (roles/privileges), Virtual Warehouses (sizing, scaling, concurrency), Time Travel & Cloning, VARIANT/JSON
  • AWS: S3, Lambda, Glue (Spark), ECS, MSK, API Gateway
  • Streaming / CDC: Kafka, Kafka Connect, Debezium, Confluent platform
  • Orchestration / DevOps: Airflow, Terraform (IaC), Docker, GitHub Actions, Azure DevOps
  • Data Lake / Table Formats: Apache Iceberg
  • Tools: Coalesce, dbt, Talend, Workato
  • Reporting (supporting): Tableau, JasperSoft
  • Leadership / Delivery: Sprint planning, standups, stakeholder management, architecture documentation, mentoring
  • Performance optimization
  • Enterprise architecture design

Certification

Snowflake SnowPro Core Certification (Nov 2025)

LANGUAGES

English (Advanced)
Hindi (Advanced)
Gujarati (Proficient)

Timeline

Solution Architect / Lead Data Engineer

phData
09.2025 - Current

Technical Architect

Tech Mahindra
05.2022 - 09.2025

Lead Consultant

Xebia IT Architects
04.2021 - 05.2022

Senior Consultant

Deloitte Consulting India Pvt. Ltd.
08.2018 - 03.2021

Senior Software Engineer II

Continuum Managed Solutions Pvt. Ltd.
03.2014 - 07.2018

Software Developer

Parekh Integrated Services India Pvt. Ltd.
07.2012 - 03.2014

Diploma of Higher Education - Software Engineering (GNIIT)

NIIT

B.Com - undefined

Nagindas Khandwala College of Arts and Commerce

MCA -

Sikkim Manipal University (SMU)
Bhupendra Baraiya