Summary
Overview
Work History
Education
Skills
Certification
Awards Recognition
Timeline
Generic

SAI KUMAR GONUGUNTA

Summary

Results-driven Data Engineer with 3.3+ years of experience in building scalable ETL pipelines, real-time data flows, and analytics-ready datasets using Azure, PySpark, and Delta Lake. Proven expertise in delivering high-performance data solutions that drive product insights and business decisions.

Overview

3
3
years of professional experience
1
1
Certification

Work History

Data Engineer

Tata Consultancy Services
12.2021 - Current
  • Designed and developed scalable ETL pipelines using PySpark in Databricks, improving data availability for product analytics by 30%
  • Migrated on-premises data warehouse to Azure with Azure Data Factory and Databricks, achieving 99.9% uptime and enhancing data processing speed by 25%
  • Optimized SQL-based transformations in Databricks and Azure Synapse, reducing query execution time by 30% and accelerating data retrieval for key business dashboards
  • Implemented an orchestration framework in Apache Airflow, automating end-to-end pipeline execution and reducing manual intervention by 40%
  • Engineered a proactive data framework that detected and resolved pipeline failures, reducing data downtime by 40%
  • Integrated multiple data sources (databases, APIs, event streams) into Azure, creating a unified data platform for business intelligence, reporting, and predictive analytics
  • Enhanced data storage efficiency using Delta Lake in Databricks, ensuring ACID transactions, data versioning, and optimized query performance
  • Developed real-time monitoring dashboards in Azure, enabling self-service analytics and reducing incident resolution time by 40%
  • Collaborated with engineering teams to design scalable data models that improved query efficiency and supported product feature development

Education

Bachelor of Technology - Electronics and Communication Engineering

QIS Institute of Technology (2017-2021)
Ongole

Skills

  • ETL & Data Transformation: PySpark, SQL, Azure Data Factory, Databricks
  • Cloud & Storage: Azure Data Lake Storage Gen2, Azure Synapse, Blob Storage, Key Vault
  • Data Architecture: Delta Lake, Medallion Architecture, Lakehouse Design
  • DevOps & CI/CD: Git, Azure DevOps, CI/CD Pipeline Integration
  • Orchestration & Monitoring: Apache Airflow, ADF Triggers, Azure Monitor
  • Analytics & BI: Power BI
  • Data Quality & Governance: Data Validation, Error Handling, Unity Catalog
  • Data Modeling: Dimensional Modeling, Star & Snowflake Schema Design

Certification

  • DP-203 – Data Engineering on Microsoft Azure
  • AZ-900 – Microsoft Azure Fundamentals
  • AI-900 – Microsoft Azure AI Fundamentals
  • PL-300 – Microsoft Power BI Data Analyst

Awards Recognition

  • Star of the Month – Recognized for driving improvements in data pipeline efficiency and reliability
  • Best Performance Award – Acknowledged for automating ETL workflows and optimizing performance in Databricks
  • Beyond Excellence Award – Appreciated for implementing practical Azure + PySpark solutions that improved data delivery and reduced processing time

Timeline

Data Engineer

Tata Consultancy Services
12.2021 - Current

Bachelor of Technology - Electronics and Communication Engineering

QIS Institute of Technology (2017-2021)
SAI KUMAR GONUGUNTA