Summary
Overview
Work History
Education
Skills
Certification
Projects
Accomplishments
Timeline
Generic

Iswarya Lakshmi Potnuru

Srikakulam

Summary

Data Engineer with expertise in SQL and PySpark, focused on optimizing ETL processes and enhancing data warehousing solutions. Demonstrated success in automating data pipelines and improving reporting efficiency. Strong analytical skills contribute to excellence in data management.

Overview

1
1
year of professional experience
1
1
Certification

Work History

Data Engineer

Syren Cloud
07.2024 - Current
  • Contributed to the Engineering and Property Services (E&Ps) domain by implementing additional data warehousing requirements using SQL, optimizing ETL pipelines, and improving reporting efficiency.
  • Designed and implemented efficient data warehousing solutions using SQL and DDL, resulting in faster data retrieval.
  • Optimized existing data pipelines using PySpark, reducing processing latency for critical dashboards.

Data Engineering Intern

Syren Cloud Company
02.2024 - 07.2024
  • Gained hands-on experience with PySpark, Azure Cloud, and data lake architecture for scalable ETL processes.
  • Developed and implemented data modeling techniques for a big data project, enhancing data organization and accessibility.
  • Automated data transformation pipelines and scripted solutions using Python.

Education

B.Tech - Information Technology

Anil Neerukonda Institute of Technology And Sciences (ANITS)
Visakhapatnam
05.2024

Skills

  • C programming and Java
  • Bash scripting
  • Apache Spark and PySpark
  • Hadoop ecosystem
  • Hive querying
  • Azure Data Factory
  • Azure Blob Storage
  • Azure Databricks
  • Azure Synapse Analytics
  • Python for data analysis
  • Data modeling techniques
  • ETL and ELT processes
  • SQL command proficiency
  • Bronze-Silver-Gold architecture design
  • Git version control (basic)
  • Data warehousing principles

Certification

  • Microsoft Certified: Azure Data Fundamentals (DP-900)
  • Databricks Data Engineer Associate
  • Databricks Fundamentals of Lakehouse Platform

Projects

Olympic data analytics

  • Pipeline designed and developed a complete data analytics pipeline for Olympic historical data using the Azure data engineering stack
  • Ingested data using Azure Data Factory into Azure Data Lake Gen2, processed with PySpark in Databricks, and implemented Bronze, Silver, Gold architecture
  • Loaded cleaned data into Azure Synapse Analytics and visualized results using Power BI
  • Implemented basic orchestration and monitoring using Airflow, improving reliability and observability of data flows

Ipl data analysis using azure databricks

  • Built an ETL pipeline to analyze IPL match and player data using PySpark in Azure Databricks, with all data stored and processed in DBFS
  • Generated insights, like top players and team performance trends, using Spark SQL and Python, visualized analysis using Matplotlib, Seaborn, and custom Python scripts
  • Focused on optimization through data partitioning and caching

Accomplishments

  • Solved 150+ coding challenges on HackerRank; earned Gold Badge in SQL & Python
  • Ranked 50 in college leaderboard on GeeksforGeeks
  • Placed in top 15% for SQL skill assessment on LinkedIn

Timeline

Data Engineer

Syren Cloud
07.2024 - Current

Data Engineering Intern

Syren Cloud Company
02.2024 - 07.2024

B.Tech - Information Technology

Anil Neerukonda Institute of Technology And Sciences (ANITS)
Iswarya Lakshmi Potnuru