Experienced IT professional with 14+ years in Azure Data Engineering and Analytics specializing in building scalable data platforms leveraging Azure Databricks, Delta Lake, and native Azure Services. Proficient in data ingestion, transformation, and analytics using PySpark, Azure Data Factory (ADF), and Azure Synapse Analytics. Skilled in ETL and ELT Architecture, Delta Lake Medallion Architecture (Bronze, Silver, Gold layers), incremental loading strategies, and performance optimization with advanced PySpark transformations. Adept at developing interactive Power BI reports, ensuring secure data processing with Azure Key Vault, Managed Identities, and Azure AD. Strong expertise in Database design, Dimensional Data Modelling, and Workflow orchestration, enabling actionable insights through secure, reliable, and efficient data solutions.
Data Engineer at Mesmerise Group UK through Globalization HR Solutions India Private Limited
Methodologies: Agile, Scrum
Azure Data Engineering with Databricks Experience:
and timestamp conversions.
Azure Data Engineering with Native Services Experience:
Database Design and Modelling Experience:
Power BI Development and Analytics Experience:
Full Stack Development Experience:
Collaboration and Documentation:
Client Name: Carbonaires
Project Name: Carbonaires Data Platform
Duration: (Aug 2023 – Sept 2024)
Role: Data Engineer
Technologies Used: Azure Databricks, PySpark, Delta Lake, Azure SQL, Azure Data factory(ADF), ETL, Data Lake (ADLS), Azure Key-Vault, Managed Identity, Azure AD, Azure DevOps (git & CICD), Power BI.
Description: Carbonaires is an ESG-driven carbon asset management company that offers investors exposure to carbon credits, essential for achieving carbon neutrality and net-zero goals. Their business model revolves around partnering with carbon credit project developers, providing upfront funding in exchange for a share of future carbon credits produced. Carbonaires sell these carbon credits to corporations seeking to offset their emissions, optimizing value by purchasing high-integrity credits at low prices and selling them at higher rates. To enhance transparency and project valuation, they aim to collaborate with Mesmerise to build a data platform for improved valuation and risk management.
Client Name: Mesmerise Group
Project Name: Gatherings Data Platform & Analytics
Duration: (Nov 2022 – Aug 2023)
Role: Data Engineer and Analyst
Technologies Used: Azure Event Hub, Stream Analytics, Azure SQL, Azure Data factory(ADF), ETL, Azure Databricks, PySpark, Data Lake (ADLS), Azure Key Vault, Managed Identity, Azure AD, Azure DevOps (git & CICD), Power BI
Description: The Gatherings VR Application, developed by Mesmerise Group, is a virtual reality (VR) platform that enables people to meet and interact in an immersive virtual environment. It serves as a digital space where users can gather for virtual meetings, conferences, or social events, providing an engaging alternative to traditional video calls. Through this platform, participants can experience a sense of presence and connection, enhancing collaboration and interaction in a way that mimics real-life gatherings. To support this application, the Gatherings VR Data Platform project focuses on data ingestion, transformation, and analytics, enabling insights into user engagement, platform performance, and application usage patterns. As a Data Engineer, my role involves building a scalable data infrastructure that captures and processes both real-time and historical data generated within the VR environment.
Client Name: Maritzcx
Project : Customer Experience Platform.
Duration : May 2018 – Nov 2022
Technologies Used : SQL server, Data Lake (ADLS), Azure Key-Vault, Azure Data factory(ADF), Azure Databricks, PySpark, Power BI
Description: This CX platform is a customer feedback management system. With this application, customers can create the survey and send the invitations, so that the end user can take the survey, and feedback is collected. Platform allows customers to generate and schedule the various pre-defined reports based on the collected data. Some customers have custom requirements around the platform features. It needs to develop the custom applications to get that customers requirement like custom invitations, custom reports export and survey extensibility etc.
Project : Data Feeds
Duration : June 2014 – April 2018
Technologies Used : C#, ASP.NET, SQL, SSIS
Description: GroupM has developed a tool called Data Feeds. It's a collection of different feeds which we have developed to consume data from different vendors like (Facebook, Twitter, Google Analytics, and some of internal data within GroupM). And this data is then used by other tools of GroupM like Datamart and data marketplace. Each feed is a separate implementation and separate library.
Project : iSERAS (iProspect Search Engine Result Analysis System)
Duration : Jan 2013 - May 2014
Technologies Used : C#.net, MVC3.0 (razor), JQuery, SQL Server
Description: iSERAS is the Search engine result analysis system. This tool is used by the SEO analyst team that helps end clients to improve their ranking and set their goals for internet Advertising. This application consists of two main parts as a Web reporting UI & background data collection application. The data collector application collects ranking data from different sources like Google and Bing API. This application also collects PPC data from Google Adwords and MSN AdCenter API. The reporting UI part consists of a number of reports for Ranking and PPC data. These advanced reports (such as Pie charts, Bar charts, multiple axis graphs etc.) are generated using Telerik / Kendo Controls.
Project : IRC (iProspect Ranking Collector)
Duration : Jan 2010 – Dec 2012
Technologies Used : C#.net, MVC3.0 (razor), JQuery, SQL Server
Description: iRC system collects rankings from Google, Yahoo, Bing, AOL, Ask and YouTube search engines for various markets. The system records organic/sponsored rankings for given keywords against mentioned domains URL. This system also provides the Ranking report, Sponsored analysis report, competitors ranking report, non-defined competitor reports. This system collected top 30 rankings for given keywords for each client specified search engine. iRC system is divided into 3 main modules as Admin interface, Data Collector, UI reporting.