Summary
Overview
Work History
Education
Skills
Projects
Certification
Timeline
AssistantManager
Anthaiah Gangarapu

Anthaiah Gangarapu

Data Engineer
Hyderabad

Summary

Results-driven Data Analyst with internship experience at Revature, skilled in Python, BigQuery, predictive modeling, trend analysis, and data visualization. Experienced in developing data-driven solutions to improve inventory management and forecasting accuracy. Proven ability to analyze complex datasets and deliver actionable insights that support strategic decision-making. Strong team collaborator focused on performance improvement and business impact.

Overview

3
3
Certifications

Work History

Data Analytics

Revature
Hyderabad
04.2025 - 06.2025
  • Applied advanced data analytics techniques to identify patterns and trends in system performance, leading to targeted performance improvements.
  • Developed predictive models to enhance inventory management strategies, improving forecasting accuracy and reducing stock-related inefficiencies.
  • Conducted trend analysis using analytics tools to support strategic decision-making across operations and finance departments.
  • Contributed to end-to-end data workflows, including ingestion, transformation, and analysis using Python, SQL, Hadoop, Hive, PySpark, and BigQuery.
  • Analyzed YouTube and Spotify datasets to uncover music trends, user behavior, and content performance insights.
  • Leveraged data analytics tools to gain insights into system usage patterns, identifying opportunities for resource optimization and cost reduction.

Education

Master of Computer Applications [MCA] - CSE

KL University
Vijayawada
04.2001 -

Bachelor of Science - Computer Science

Vignan Degree & PG College
Guntur
04.2001 -

Skills

Cloud Platforms & Services:

Networking Concepts:

Software Development Life Cycle (SDLC), Agile Methodology

Development & Methodologies:

Git, GitHub

Version Control:

MySQL, Hive, SQL, BigQuery

Databases & Query Languages:

Hadoop Distributed File System (HDFS), Apache Hive, Apache Beam, PySpark

Big Data & Processing Technologies:

Jupyter Notebook, Google Colab, Data Analysis, Data Integration, Dashboard Design, Data Visualization, Trend Analysis

Data Analytics & Visualization Tools:

Python, PySpark

Programming Languages:

Google Cloud Platform (GCP), Cloud Storage, Cloud Composer, Pub/Sub, Apache Beam, BigQuery

Cloud Platforms & Services:

Projects

YouTube Data Analysis using Python & MySQL

  • Cleaned and analyzed the YouTube 2025 dataset to uncover trends in video length, content type, engagement, and subscriber growth.
  • Migrated data into MySQL and used complex queries to extract business insights.
  • Visualized key metrics using Python libraries for stakeholder reporting.

Tools: Python, MySQL, Pandas, Matplotlib, Git

Spotify Songs Data Analysis – Phase 1

  • Processed large-scale Spotify data to analyze genre popularity and audio features like tempo and energy.
  • Applied PySpark and Hive on Hadoop for scalable data transformation and used BigQuery for querying.
  • Delivered visual reports to guide strategy for music platforms and artists.

Tools: Python, MySQL, Pandas, Matplotlib, Git

Serverless Orchestrated ETL Pipeline – Phase 2 (Advanced)

  • Built a fully automated ETL pipeline on GCP to ingest, transform, and load Spotify data into BigQuery.
  • Used Cloud Composer (Airflow) to orchestrate workflows triggered by file uploads to GCS.
  • Implemented Apache Beam (Dataflow) jobs for transformation and real-time data integration.
  • Improved pipeline scalability, automation, and data freshness.

Tools: GCP (GCS, Cloud Composer, Pub/Sub, Dataflow), BigQuery, Python

Certification

Cisco Network Academy

Timeline

Data Analytics

Revature
04.2025 - 06.2025

Master of Computer Applications [MCA] - CSE

KL University
04.2001 -

Bachelor of Science - Computer Science

Vignan Degree & PG College
04.2001 -
Anthaiah GangarapuData Engineer