Websites
Summary
Overview
Work History
Education
Skills
Affiliations
Personal Traits
Accomplishments
Timeline
Hi, I’m

Praveen Singh

Data Engineer
Ghaziabad,UP

Summary

Dynamic Data Engineer with nearly 2 years of experience building and optimizing scalable data pipelines, skilled in leveraging Google Cloud Platform to automate workflows, enhancing product tagging, and review summarization systems. Effective collaborator with cross-functional teams, consistently delivering high-quality data solutions that drive efficiency and align with business objectives.

Overview

2
years of professional experience

Work History

Badaa Data
Ghaziabad

Data Engineer
08.2023 - Current

Job overview

Project : Product Content and Intelligence (client - Wayfair)

  • Data Engineer with almost 2 years of experience in building , maintaining scalable, automated data pipelines across cloud-native environments.
  • Accountable for automation of product tagging pipelines using Google Cloud Pub/Sub, BigQuery, and Airflow (Cloud Composer), reducing manual intervention by 60%.
  • Enhanced product attribute accuracy (color, shape, material , subject , style, lifestage) by 30% by integrating ML inference into enrichment workflows.
  • Developed SQL-based enrichment processes, improving data preparation and reducing latency in ADS endpoint submissions.
  • Applied Python, specifically leveraging PANDAS, for transformation and enhancing data quality
  • Created unified schemas and validation logic across multiple Pub/Sub topics, ensuring consistent and reliable data exchange between systems.
  • Automated data uploads using Python, reducing operational overhead and increasing pipeline reliability.
  • Integrated Datadog dashboards and alerting to monitor performance metrics such as job failures, latencies, row-level updates, and endpoint success rates.
  • Enabled proactive issue resolution by integrating PagerDuty with Airflow for real-time alerts via email, SMS, and phone.
  • Strong collaborator with ML, UFS teams, delivering production-ready data workflows with clear schema documentation and lifecycle management.

Project : Review Summarization system (client - Wayfair)

  • Collaborating with the UFS teams to design and implement a Gen AI review summarization system. where i defined and documented the schema for four Pub/Sub topics, including aspect extraction and review summarization inputs and outputs, ensuring clarity in data exchange workflows.
  • Developed a review summarization pipeline that processes live review events, extracts relevant aspects, and generates concise summaries for SKU reviews based on defined thresholds.
  • Established data constraints and validation rules to maintain data integrity and optimize the summarization process, including thresholds for minimum review counts and incremental updates. thus improving processing efficiency by 40%.

Tellusys info private limited
Gurugram

Technical Support Engineer
07.2022 - 07.2023

Job overview

  • worked as a technical support engineer where i assissted customers with the setup and configuration of VoIP systems, including SIP trunks , IP phones and PBX system , resolving technical issues related to call quality, connectivity and hardware/software problems promptly and effectively .

Education

Arya College of Engineering And IT
Jaipur

from Electronics And Communications Engineering
07-2021

National High School
Kolkata

12th from Science
06-2016

Andhra Association High School
Kolkata

10th
05-2014

Skills

  • Python
  • SQL
  • Pandas
  • Pyspark
  • Airflow
  • GCP (BigQuery, GCS, Cloud Composer, Pub/Sub, DataProc, Cloud Functions, Cloud SQL/Spanner, Apache Beam, Vertex AI)
  • Snowflake
  • DSA (linked list, insertion sort, merge sort)
  • etl
  • Prompt engineering

Affiliations

  • Cricket
  • Football
  • kabaddi

Personal Traits

  • Excels in both autonomous roles and team environments, consistently driving results while fostering collaboration and mutual success
  • Willingness to learn, to perform and take on responsibilities and challenges.
  • A smart negotiator with abilities in initiating cost effective arrangements.

Accomplishments

  • Secured 1st position in 12th board exams
  • Four times awarded in the past two years as Employee of the Month and appreciation token

Timeline

Data Engineer

Badaa Data
08.2023 - Current

Technical Support Engineer

Tellusys info private limited
07.2022 - 07.2023

Arya College of Engineering And IT

from Electronics And Communications Engineering

National High School

12th from Science

Andhra Association High School

10th
Praveen SinghData Engineer