A seasoned Data Architect and Data Engineer with over 12 years of experience designing and implementing data platforms on hyperscalers like Azure and GCP. Skilled in utilizing cloud-native and open-source technologies to support leading automotive, manufacturing, and BFSI clients. Proficient in modern data architectures such as Lakehouse, DataMesh, and active metadata management, with a strong focus on data quality, automation, and integrity. Adept at designing metadata-driven solutions based on the Medallion architecture with integrated data governance. Additionally, possesses extensive knowledge of traditional dimensional modeling and data warehousing, along with expertise in Python/SQL-based analytics solutions. A proven leader, having successfully led cross-functional teams, driven digital transformation, and defined data roadmaps to deliver strategic value for clients.
Key Technologies worked:
Data architecture,DataBricks,DataFactory,SnowFlake, Azure, GCP, Data Lake, Delta Lakehouse, Data Governance,Python,NoSql
Key Responsibilities:
Key Technologies worked:
Azure Databricks,Azure DataFactory,Azure IOTHUB,Azure Event hub,Grafana,Stream analytics,PowerBI
Key Responsibilities:
Key Technologies worked:multi collinearity
ADLS Gen2, Azure SQL, IOT hub, Azure ML,ADB,SSIS,PowerBI,Grafana
Key Responsibilities:
Key Technologies worked:
SSIS,SQL,SSRS,Crystal Reports
Key Responsibilities:
Key Technologies worked:
SQL Development,Performance Optimization,Application development
Key Responsibilities:
SQL
ETL development
Data Analytics
Data Modeling
Real-time Analytics
Data integration
Data Lakehouse
1.Bosch Semantic Stack Data Layer(Oct 2023- till date):
Role: Data Architect/Senior Data Engineer
Project Description: Unified Data Platform Connective Automotive use cases in Bosch.
Tech stack: Azure, Databricks, ADF,ADLS Gen2,Azure Devops
2.CIMB-DataCore(Jan 2023-Nov 2023)
Role: Data Architect/Senior Data Engineer
Project Description: Real Time Lakehouse for Gold standard Master data consumption
Tech stack: GCP Databricks, Unity catalog, GCS, Confluent Kafka, Spark structured streaming.
3.VHIT-Lake house implementation(July 2022 – April 2023)
Role: Data Architect/Senior Data Engineer
Project Description: Real Time Lakehouse for Gold standard Master data consumption
Tech stack: Azure Databricks, Azure data factory
4.DFL-2.1(Nov 2021– June 2022)
Role: Data Architect/Senior Data Engineer
Project Description: Migration of existing Big data application from Synapse to Databricks and implementation of Data lake for processing data from 5 different applications.
Tech Stack: Azure Databricks , Azure data factory, ADLS gen2,Azure SQL server, Pyspark , IOTHUB, PowerBI , Grafana
5.DataNexus: Factory of Future(Feb 2020– June 2021 )
Role: Data Engineer Specialist
Project Description: Migration of existing Big data application from Synapse to Databricks and implementation of Data lake for processing data from 5 different applications.
Tech Stack:
Azure Databricks , Azure data factory, ADLS gen2,Azure SQL server, Azure functions. Pyspark
Mixing 2.0(Jan 2018 – Aug 2020)
Role: Data Engineer
Project Description: Implementation of recipe generation algorithm to shortlist best set of raw materials (from half a million possibilities) from stock warehouse for Push belt production. The objective to is to enhance the pushbelt quality and reduce scrap.
Tech Stack: Python, pandas, numpy , Flask, Postman, pycharm
Titan MIS/GHS Reporting(Jan 2016 – Jun 2016)
Role: Sql Developer/ETL Developer
Project Description: Development and maintenance of MIS system that hosts sales,inventory,after service reports of various business segments.
Supported and maintained existing report portals and data base ecosystem by debugging complex SQL queries and fixing bugs when needed
Tech Stack:SSIS, SSRS, SQL server, C#,ASP.net,Python