Seasoned Data Engineer with a strong track record in designing and implementing scalable data pipelines on both Google Cloud Platform (GCP) and AWS. Expertise in Dataproc, BigQuery, Composer, AWS EMR, Glue, and Redshift. Adept at optimizing data processing workflows, ensuring data quality, and delivering actionable insights through effective visualization.
Work History
Data Engineer
Project: Telecom Network Data Aggregation and Analytics
Designed and implemented a batch processing pipeline on GCP for aggregating and analyzing network performance data.
Utilized Google Cloud Dataproc for batch processing and Google BigQuery for data warehousing.
Automated ETL workflows with Google Cloud Composer to ensure timely data processing and loading.
Developed interactive dashboards using Google Data Studio for real-time network performance monitoring and reporting.
Data Engineer
Project: AWS Batch Data Processing Pipeline
Developed a comprehensive batch processing pipeline using AWS services to manage and analyze large volumes of data.
Used AWS Glue to extract data from relational databases (RDBMS) and transform it.
Wrote the transformed data to Amazon S3 for durable and scalable storage.
Employed AWS Glue to load data from S3 into Amazon Redshift for further analysis.
Utilized AWS EMR (Elastic MapReduce) for complex data processing tasks and transformations.
Performed data analysis and generated insights using SQL queries in Amazon Redshift, with results visualized through Amazon QuickSight.
Data Engineer
Project: GCP Batch Data Ingestion from Teradata
Designed a batch ingestion pipeline to transfer data from Teradata to BigQuery, processing the data using Dataproc.
Extracted data from Teradata using Google Cloud Dataproc for processing.
Loaded the processed data into BigQuery’s staging area.
Implemented transformation logic in BigQuery’s transformed layer for final data processing.
Scheduled and managed the ETL workflow using Google Cloud Composer to ensure regular and automated data updates.
Director of Project Management – Director of Operations & Data Analytics at Director of Project Management – Director of Operations & Data AnalyticsDirector of Project Management – Director of Operations & Data Analytics at Director of Project Management – Director of Operations & Data Analytics