Summary
Overview
Work History
Education
Skills
Timeline
background-images

SHIVANI KUMARI

Bangalore

Summary

Data Engineer with 3.2 years of experience in building scalable, data-intensive applications across diverse industries. Proficient in data ingestion, storage, transformation, and visualization. Skilled in developing and deploying adaptive services that translate business requirements into effective solutions.

Overview

4
4
years of professional experience

Work History

Data Engineer

DataSturdy Consulting Private Limited
10.2021 - Current
  • Client: Myntra Pvt Ltd

Project I: Azure Data Warehouse Administrator

  • Designed and managed Myntra's large-scale Data Warehouse systems (18,000 TB, 6,000–10,000 DWU), optimizing data models by applying appropriate distribution keys and load strategies for over 5,000 tables.
  • Developed complex data processing procedures and data replication pipelines using Azure Data Factory, ensuring efficient data movement and transformation.
  • Enhanced SQL query performance, designed scalable data warehouse architectures, and implemented effective workload management strategies for resource allocation and concurrency, along with setting threshold alerts for monitoring and optimization.
  • Optimized the database by correcting indexes, updating statistics, aligning tables with appropriate distribution strategies, and performing code optimizations, resulting in annual cost savings of 33 lakhs and reducing DWU usage to 5,000.


Project II: Data-Warehouse to Delta Table (Databricks) Migration

  • Migrated over 30 complex stored procedures from Azure Data Warehouse to Databricks by converting T-SQL scripts into PySpark and Spark SQL, and creating Databricks jobs for execution.
  • Efficiently scheduled and orchestrated the jobs using Apache Airflow, ensuring streamlined workflows and timely execution.
  • Designed and implemented robust Data Quality Checks to ensure the accuracy, consistency, and reliability of the migrated data.
  • Monitored and maintained the aggregated Delta Tables to guarantee consistent performance, scalability, and data availability.


Project III: Offline Reporting

  • Analyzed business processes for Offline Reporting (B2B & B2C) to identify key data sources and necessary data attributes for end-to-end data pipelines
  • Onboarded 10+ vendors and integrated data from 20+ reports via SFTP to Bifrost/Trino in Delta format
  • Developed and automated ingestion pipelines using PySpark on Databricks and Apache Airflow, and created SQL-based aggregations for business analysis
  • Built over 25 Databricks jobs to synchronize aggregated tables on a daily basis.


  • Project IV: Power BI Development and Administrative
  • Consumed data from multiple source systems, including MemSQL, Azure Data Warehouse and Delta Tables (Databricks), to build over 20+ reports using both DirectQuery and Import Mode.
  • Administered user accounts, roles, and permissions to ensure secure access to Power BI resources while maintaining compliance with business requirements.
  • Configured and maintained Power BI Premium P1 capacity, workspaces, reports, and data sources.
  • Built an end-to-end Power BI system by setting up authentication through Personal Access Tokens and Service Principal.
  • Monitored and optimized report performance, ensured data governance practices, and maintained data quality and compliance.

Data Engineer

DataSturdy Consulting Private Limited
10.2021 - 06.2022
  • Client: OLA Electric
  • OLA Telematics (Azure Data Explorer & Power BI) Spearheaded the end-to-end implementation of OLA Telematics, ensuring seamless data ingestion, storage, and advanced analytics
  • Engineered real-time monitoring systems for vehicle parameters using Azure Data Explorer and designed time series analysis workflows for historical performance review
  • Developed and optimized rule-based alert systems (e.g., theft, crash, battery temperature) utilizing fast event processing in a streaming data environment
  • Executed data transformations, direct queries, and JSON parsing, culminating in robust Power BI dashboards and real-time alerts configured through Azure Logic Apps

Education

Master Of Computer Application -

Dr. Ambedkar Institute of Technology
01.2021

Bachelor of Computer Application -

Vivekananda Institute of Management
01.2019

Skills

  • Microsoft SQL Server
  • Azure Data-warehouse (Azure Synapse)
  • OLTP & OLAP
  • Azure Databricks
  • Spark (PySpark)
  • Python
  • SQL
  • Azure Data Explorer(KQL)
  • Power BI (Data Modeling & Data Validation)
  • Azure Fabric
  • ADLS Gen2
  • Azure Data Factory (ADF)
  • Azure Logic App
  • Azure Event Hub
  • Service Principal
  • Airflow
  • JIRA
  • Git
  • MS Office

Timeline

Data Engineer

DataSturdy Consulting Private Limited
10.2021 - Current

Data Engineer

DataSturdy Consulting Private Limited
10.2021 - 06.2022

Bachelor of Computer Application -

Vivekananda Institute of Management

Master Of Computer Application -

Dr. Ambedkar Institute of Technology
SHIVANI KUMARI