Summary
Overview
Work History
Education
Skills
Websites
Accomplishments
Hobbies and Interests
Languages
Timeline
Generic

Sai KUMAR Peesa

Visakhapatnam

Summary

Data Engineer with over 3 years of experience in designing and implementing scalable data pipelines and real-time processing systems. Expertise in PySpark, AWS, Databricks, Delta Lake, Iceberg, and Airflow. Proven ability to optimize workflows and reduce costs while delivering impactful analytics solutions. Holds an M.Tech. In Computer Science from IIT (ISM) Dhanbad.

Overview

3
3
years of professional experience

Work History

Data Engineer

Freo
Bangalore
11.2023 - Current
  • Automated Redshift table operations (creation, inserts, updates) using AWS Lambda and Slack integration.
  • Developed a table flattening process for Delta and Iceberg tables with over 2000 columns from CIBIL and CRIF data using PySpark on EMR.
  • Created real-time data pipelines employing Databricks with PySpark Streaming and Kinesis Streams for SMS data and app events.
  • Reduced manual SMS tagging efforts by 60% through development of a real-time tagging application in Databricks.
  • Established an SDK log table with Kinesis Streams and AWS Lambda, optimizing performance through partitioning.
  • Deployed CloudWatch monitoring for EC2 instances, achieving 99% uptime for critical data pipelines.
  • Implemented real-time performance alerts for Airflow and Superset, leading to a 15-minute average incident resolution time.
  • Streamlined data science workflows, enhancing efficiency and decreasing processing times by up to 80%.

Data Engineer

Niyo Solutions Inc
Bangalore
06.2022 - 11.2023
  • Developed ETL workflows using AWS Glue for Delta table and external table creation in Athena and Redshift.
  • Led migration from traditional Data Warehouse to Data Lake on AWS, utilizing S3 and EMR, achieving 60-70% cost reduction.
  • Migrated critical ETL workflows from AWS Glue to EMR, leveraging PySpark and Airflow, resulting in 50% cost savings.
  • Implemented real-time data processing with AWS Lambda and SQS, establishing an event-driven architecture for efficient message handling.
  • Automated Airflow DAG creation using Python scripts for data ingestion and transformation, enhancing workflow reliability.
  • Collaborated with tech and product teams to design reports and dashboards in Superset and QuickSight for actionable insights.

Education

M.Tech - Computer Science (Spl. In Information Security)

IIT(ISM)
05.2022

B.Tech - Computer Science

CVR College of Engineering
04.2019

Skills

    Data pipeline development and ETL processes

  • Real-time analytics and big data processing
  • Data lake architecture
  • AWS services and Python programming
  • Data visualization and reporting
  • Analytical thinking and problem solving

Accomplishments

  • 5-star rated on HackerRank with 500+ problems solved across LeetCode, HackerRank, and GeeksforGeeks.

Hobbies and Interests

  • Traveling and exploring new places to refresh ideas and creativity.
  • Reading mythological books and watching stories to explore cultural narratives and ancient wisdom.

Languages

  • English
  • Hindi
  • Telugu

Timeline

Data Engineer

Freo
11.2023 - Current

Data Engineer

Niyo Solutions Inc
06.2022 - 11.2023

M.Tech - Computer Science (Spl. In Information Security)

IIT(ISM)

B.Tech - Computer Science

CVR College of Engineering
Sai KUMAR Peesa