Summary
Overview
Work History
Education
Skills
Certification
LANGUAGES
Timeline
background-images
SURYA TEJA NAVUDURU

SURYA TEJA NAVUDURU

Chennai

Summary

Detail-oriented and results-driven Data Engineer with over 7 years of experience in designing and implementing large-scale data pipelines using PySpark, Azure Data Factory, and Databricks. Proven track record in migrating complex data systems, optimizing ETL workflows. Skilled in SQL, Delta Lake, and data governance frameworks. Strong communicator with a focus on collaboration, automation, and data-driven decision-making.

Overview

8
8
years of professional experience
1
1
Certification

Work History

Data Engineer - PySpark Developer

Cognizant Technology Solutions
12.2021 - Current
  • Migrated healthcare compliance data from Cloudera and Informatica to Azure and AWS cloud, ensuring adherence to market-specific reporting standards.
  • Developed scalable ETL pipelines using PySpark, Spark SQL, and Azure Data Factory for efficient data ingestion, transformation, and storage.
  • Transitioned complex Domo and MIDAS dataflows to Databricks Delta Lake, improving data refresh time by 30%.
  • Integrated Power BI for real-time reporting, enhancing business intelligence and stakeholder visibility.
  • Applied Unity Catalog for data governance and fine-grained access control across workspaces.
  • Automated workflows using Apache Airflow to minimize manual intervention and improve reliability.

Data Engineer

HCL Technologies
10.2017 - 11.2021
  • Designed and maintained batch ETL pipelines using Hive, PySpark, and Sqoop to support a multi-country financial services platform.
  • Developed liquidity management reporting solutions including LCR and NSFR metrics.
  • Conducted performance tuning for large-scale queries, improving execution speed by 40%.
  • Ensured high data quality and compliance with financial regulations through rigorous validation processes.

Education

Bachelor of Technology - Power Engineering

GMR Institute of Technology
04-2016

Skills

  • Programming: Python, SQL, PySpark
  • Big Data: Apache Spark, Hive, HDFS, Delta Lake
  • Cloud Platforms: Microsoft Azure, AWS
  • Tools & Technologies: Azure Data Factory, Databricks, Power BI, Sqoop, Airflow, ADLS
  • Databases: Azure SQL Database, MySQL
  • Concepts: ETL Development, Data Modeling, Data Warehousing, Data Governance, CI/CD for Data Pipelines

Certification

  • Programming in Python (Meta), 2023-03
  • Introduction to Data Analyst (IBM), 2023-03
  • Foundations: Data, Data, Everywhere (Google), 2023-03

LANGUAGES

English, Telugu, Hindi

Timeline

Data Engineer - PySpark Developer

Cognizant Technology Solutions
12.2021 - Current

Data Engineer

HCL Technologies
10.2017 - 11.2021

Bachelor of Technology - Power Engineering

GMR Institute of Technology
SURYA TEJA NAVUDURU