Summary
Work History
Education
Skills
Timeline
Generic

Sampath Kumar Reddy Soma

Summary

Seasoned Data Engineer with a strong track record in designing and implementing scalable data pipelines on both Google Cloud Platform (GCP) and AWS. Expertise in Dataproc, BigQuery, Composer, AWS EMR, Glue, and Redshift. Adept at optimizing data processing workflows, ensuring data quality, and delivering actionable insights through effective visualization.

Work History

Data Engineer

Project: Telecom Network Data Aggregation and Analytics
  • Designed and implemented a batch processing pipeline on GCP for aggregating and analyzing network performance data.
  • Utilized Google Cloud Dataproc for batch processing and Google BigQuery for data warehousing.
  • Automated ETL workflows with Google Cloud Composer to ensure timely data processing and loading.
  • Developed interactive dashboards using Google Data Studio for real-time network performance monitoring and reporting.

Data Engineer

Project: AWS Batch Data Processing Pipeline
  • Developed a comprehensive batch processing pipeline using AWS services to manage and analyze large volumes of data.
  • Used AWS Glue to extract data from relational databases (RDBMS) and transform it.
  • Wrote the transformed data to Amazon S3 for durable and scalable storage.
  • Employed AWS Glue to load data from S3 into Amazon Redshift for further analysis.
  • Utilized AWS EMR (Elastic MapReduce) for complex data processing tasks and transformations.
  • Performed data analysis and generated insights using SQL queries in Amazon Redshift, with results visualized through Amazon QuickSight.

Data Engineer

Project: GCP Batch Data Ingestion from Teradata
  • Designed a batch ingestion pipeline to transfer data from Teradata to BigQuery, processing the data using Dataproc.
  • Extracted data from Teradata using Google Cloud Dataproc for processing.
  • Loaded the processed data into BigQuery’s staging area.
  • Implemented transformation logic in BigQuery’s transformed layer for final data processing.
  • Scheduled and managed the ETL workflow using Google Cloud Composer to ensure regular and automated data updates.

Education

B.Tech -

SASTRA DEEMED UNIVERSITY
Thanjavur, Tamil Nadu

Skills

  • Python
  • PySpark
  • SQL
  • Big Data
  • Data Modeling
  • Data Warehousing
  • Data Migration
  • GCP - BigQuery, DataProc, Composer Airflow, Cloud Fucntions, GCS
  • AWS - EMR, Glue, Redshift, Lambda, EventBridge
  • MongoDB
  • API Development

Timeline

Data Engineer

Project: Telecom Network Data Aggregation and Analytics

Data Engineer

Project: AWS Batch Data Processing Pipeline

Data Engineer

Project: GCP Batch Data Ingestion from Teradata

B.Tech -

SASTRA DEEMED UNIVERSITY
Sampath Kumar Reddy Soma