Summary

Overview

Work History

Education

Skills

Accomplishments

References

Languages

Timeline

Supriya Chandru

Bengaluru

Summary

Goal-oriented IT Professional with 15+ years of experience in IT, specializing in Data Engineering. Designed and implemented data warehouses and scalable, high-performance data pipelines using Azure Databricks, Pyspark focusing on ETL and ELT workflows. Design end-to-end MLOps lifecycle implementation using Databricks, MLflow, and GitHub Action for automated model testing, evaluation, and deployment across environments.

Overview

years of professional experience

Work History

Databricks Architect

Apexon

Bangalore

09.2024 - 10.2025

Architected end-to-end ELT pipeline to migrate on-premise sales data into ADLS Gen2 using Azure Data Factory and Azure Databricks, designing a scalable Lakehouse architecture based on the Medallion (Bronze, Silver, Gold) framework.
Designed initial full load ingestion framework to load historical sales data into the Bronze layer, followed by monthly incremental loads using Auto Loader with schema evolution and checkpointing for reliable change data processing.
Built parameterized ADF pipelines to orchestrate Databricks Workflows and notebooks, enabling metadata-driven ingestion, dependency handling, incremental processing, and automated failure recovery.
Implemented optimized Delta Lake transformations with ACID compliance, partitioning, and performance tuning strategies to support high-volume monthly sales reporting and downstream analytics.
Established enterprise governance, security, and DevOps practices using Unity Catalog, RBAC, Managed Identities, Azure Key Vault, Azure DevOps CI/CD pipelines, and monitoring through Azure Monitor and Databricks job alerts.
•Designed and deployed a customer churn prediction model by implementing a complete MLOps lifecycle on Databricks, integrating MLflow for model tracking, versioning, and registry; utilized GitHub Actions to automate model training, validation, testing, evaluation, and seamless promotion across Dev, Staging, and Production environments.

Senior BigData & AI Engineer

Capco consulting services

Bangalore

11.2021 - 04.2024

•Developed and implemented a Linear Regression model to predict Airbnb customer pricing, performing data preprocessing (data cleaning, handling missing values, and normalization) and feature engineering (encoding categorical variables and deriving key features), followed by model training and evaluation; leveraged Databricks and MLflow for experiment tracking and model management, and implemented CI/CD pipelines using GitHub Actions for automated testing, evaluation, and deployment across Dev, Staging, and Production environments.
•Conducted data preprocessing, feature engineering, and model
evaluation to optimize model performance and accuracy
•Collaborated with cross-functional teams to understand business
requirements and translate them into actionable insights and
recommendations
•Designed, developed, and implemented technical data and analytic
solutions for HSBC Wholesale Banking Group using Azure cloud services
•Integration of solutions for continuous deployment
•Worked on the creation of scalable CI/CD pipelines

Senior Data Engineer

TATACONSULTANCYSERVICES

Bengaluru

02.2019 - 10.2021

Architected a robust ELT data pipeline using Azure Data Factory and Azure Data Lake Storage, implementing a medallion-style architecture (Landing, Curated, Processed layers) designed and built dimensional data models (Fact and Dimension tables) for sales and orders, optimized with full and incremental loading strategies, and delivered analytics-ready data into Azure SQL Warehouse.

Collaborated with cross-functional teams to define data modeling
standards and best practices, ensuring alignment with business
objectives and scalability of data solutions architecture
Developed Databricks notebooks for Dim tables creation using PySpark, SparkSQL for Fact tables
Implemented validation notebooks for data validation in different layers
AWS Glue for ETL pipeline creation and data import from PostgreSQL
Proposed, designed, and implemented data pipelines for ETL on AWS and
Azure

Spark Developer

L&T INFOTECH

Banglaore

08.2018 - 02.2019

Proficient in analyzing business requirements and specifications to develop and execute comprehensive test cases.
Conducted testing for standalone and web application features, ensuring full test coverage.
Extensive experience in developing test plans, creating test cases, and logging defects.
Skilled in performing GUI, Functional, Integration, System, Regression, E2E, and Database testing.
Proficient in regression testing, error documentation, and troubleshooting.
Monitored project progress and reported status against key project milestones.
Coordinated with business users to facilitate User Acceptance Testing (UAT).
Reviewed and validated test scenarios and test cases to ensure accuracy and completeness.
Assessed and maintained the Requirement Traceability Matrix to ensure alignment with business requirements.
Provided regular test status updates and detailed reports to senior management.

Senior Consultant

CAPGEMINI

Bangalore

07.2016 - 08.2018

•Analyzing big datasets, developed Spark API's, and converted Hive/SQLqueries into Spark transformations
•Imported tables from RDBMS to HDFS using Sqoop and utilized Kafka for real-time streaming
•Designed and proposed data pipelines for data ingestion into Hadoop &Data Lake

Hadoop Developer & Cloud Migration Engineer

Cognizant

Bengaluru

04.2010 - 06.2016

Migrated on-premise applications to AWS cloud
•Managed migration of almost 500 servers, including 8 business units
•Created VPC for complete control over the virtual networking environment
•Worked on CloudFormation scripts for creating VPC, subnets, route tables, and NACLS
•Implemented AWS Glue for ETL pipelines, importing data from
PostgreSQL using Crawler

Education

Master of Science - Software Engineering

BITS PILANI

Rajastan , PILANI

07-2012

Bachelor of Science - Information Science

R.L Jalappa Institute of Technology

Doddaballapur, Karnataka

07-2009

Skills

Machine learning
Azure Databricks
Azure DevOps
Github
Unity Catalog
PySpark
Pandas
Delta lake
Project management
Continuous integration/deployment

AWS glue ETL management
Machine learning integration
Data warehousing solutions
ETL processes
Databricks workflows
ML-Flow
MLOps
DLT
Python

Accomplishments

Architected and delivered end-to-end ELT pipelines ingesting data from Snowflake, APIs, and relational sources into Azure Data Lake Gen2 using Databricks, improving data availability and scalability.

Designed and implemented an enterprise Medallion Architecture (Bronze–Silver–Gold) with Delta Lake, enabling trusted, analytics-ready datasets for reporting and downstream AI/ML use cases.

Built high-performance transformation frameworks using PySpark, Spark SQL, Delta OPTIMIZE, Z-ORDER, and Liquid Clustering, reducing pipeline runtime by 35–50%.

Established strong data governance and security using Unity Catalog (RBAC, lineage, auditing) and standardized schema evolution and data quality validation across ingestion layers.

Enabled production-grade operations through CI/CD automation (GitHub/Azure DevOps), Databricks Workflows orchestration, monitoring, and cost-optimized cluster policies, improving reliability and lowering compute spend by 20%.

References

Available on requests

Languages

Kannada
English
Tamil
Telugu

Timeline

Databricks Architect

Apexon

09.2024 - 10.2025

Senior BigData & AI Engineer

Capco consulting services

11.2021 - 04.2024

Senior Data Engineer

TATACONSULTANCYSERVICES

02.2019 - 10.2021

Spark Developer

L&T INFOTECH

08.2018 - 02.2019

Senior Consultant

CAPGEMINI

07.2016 - 08.2018

Hadoop Developer & Cloud Migration Engineer

Cognizant

04.2010 - 06.2016

Master of Science - Software Engineering

BITS PILANI

Bachelor of Science - Information Science

R.L Jalappa Institute of Technology

Supriya Chandru

Summary

Overview

Work History

Databricks Architect

Senior BigData & AI Engineer

Senior Data Engineer

Spark Developer

Senior Consultant

Hadoop Developer & Cloud Migration Engineer

Education

Master of Science - Software Engineering

Bachelor of Science - Information Science

Skills

Accomplishments

References

Languages

Timeline

Databricks Architect

Senior BigData & AI Engineer

Senior Data Engineer

Spark Developer

Senior Consultant

Hadoop Developer & Cloud Migration Engineer

Master of Science - Software Engineering

Bachelor of Science - Information Science

Similar Profiles

Pelagia TambudzePelagia Tambudze

Pelagia TambudzePelagia Tambudze

Karan KapoorKaran Kapoor

David Todd JonesDavid Todd Jones