Summary
Overview
Work History
Education
Skills
Timeline
Generic

Uma Maheswari Badri

Bengaluru

Summary

Six years of industry experience, including 3 years as a Data Engineer specializing in Databricks and Pyspark. Proficient in Azure services such as Azure Data Factory, Azure Data Lake Storage Gen2, and Azure Blob Storage. Demonstrated ability to design data pipelines and ETL processes, optimize large-scale data workflows, and maintain data integrity.

Overview

6
6
years of professional experience

Work History

Data Engineer

Capgemini
Bengaluru
08.2022 - Current

Client: Unilever

  • Designed and optimized ETL pipelines with Azure Data Factory and Databricks, minimizing reporting delays.
  • Automated data ingestion from APIs, databases, and cloud storage enhances accuracy and reduces manual effort.
  • Wrote and optimized complex SQL queries for improved data retrieval performance.
  • Implemented SCD Type 1 and SCD Type 2 in Dataflow for efficient database updates.
  • Created linked services, triggers, datasets, and pipelines, ensuring timely processing of all expected files.
  • Ensured data integrity through rigorous management of pipelines and scheduled runs.
  • Built ETL pipelines using PySpark and Databricks to read data from Azure Data Factory (ADF), and load it into Bronze and Silver Delta tables.
  • Applied Slowly Changing Dimensions (SCD) Type 2 to manage historical data between Bronze and Silver Delta tables.
  • Combined data from Silver tables and created Gold Delta tables using Spark SQL to match the business logic from on-premises data warehouse systems.
  • Replicated Gold Delta tables into Google BigQuery using PySpark in overwrite mode for seamless data access.
  • Built Airflow DAGs to automate and orchestrate data loading from Bronze to Silver tables.

Automation Test Engineer

Capgemini
Bengaluru
06.2019 - 07.2022

Client: Medica

  • Being the automation resource, the job is to automate the regression suite TCs and build the utilities for the possible areas of automation.
  • Hands-on experience with API, Excel, and process automation, along with web and desktop applications.
  • Involved in the development of new scripts, code refactoring, and code fixes.
  • Carried out the execution of the automation scripts during upgrade releases.
  • Worked in Agile-oriented development activities.
  • Mentoring the new joiners and junior resources with their automation tasks.

Education

B.Tech - Computer Science

Sri Venkateswara College of Engineering
Tirupati
05-2019

Skills

  • Cloud platforms: Azure (ADLS Gen2, ADF)
  • Database management: Azure SQL Database
  • Programming languages: Python, Pyspark, SQL, C#
  • Big data technologies: Databricks, Spark
  • ETL/ELT tools: Azure Data Factory, Databricks

Timeline

Data Engineer

Capgemini
08.2022 - Current

Automation Test Engineer

Capgemini
06.2019 - 07.2022

B.Tech - Computer Science

Sri Venkateswara College of Engineering
Uma Maheswari Badri