Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Generic

RAMAN GANGWANI

New Delhi

Summary

Tech professional with 2.7+ years of experience and hands-on experience in designing, developing, and maintaining scalable data pipelines and architecture. Adept at using tools like SQL, Python, and cloud platforms to optimize data workflows and ensure data integrity. Strong problem-solving skills combined with collaborative approach to drive data solutions that support business objectives.

Overview

2
2
years of professional experience

Work History

Data Engineer

Vaco Binary Semantics
06.2024 - Current
  • Developed SQL scripts to perform comparative analysis between legacy and updated data feeds, including the calculation of variances.
  • Designed and managed data ingestion pipelines for integrating data of four-wheelers across different locales onto Google's Knowledge Graph, enhancing data structuring and search capabilities.
  • Applied PySpark and Python scripts to clean, filter, and join large datasets, leveraging SQL queries for data transformation and validation.
  • Improved the ETL workstreams by adding string data quality checks and schema validation, resulting in higher data accuracy.
  • Scoped client requirements and built Python-based ETL pipelines to ingest multi-locale drug data into Google’s Knowledge Graph, enhancing data structure, and insights.
  • Worked on Google Search-related projects, optimizing data workflows to boost search operations and maintain high data quality standards.
  • Developed interactive dashboards and reports using Power BI, enabling stakeholders to monitor data ingestion performance, identify anomalies, and derive actionable insights.

Data Engineer

Badaadata
05.2023 - 06.2024
  • Built and scheduled SQL/BigQuery jobs for data analysis; monitored and resolved failures using Kronos.
  • Converted Python scripts to BigQuery SQL, with client-specific transformations and data validation.
  • Used advanced SQL and Pandas for data wrangling, improving data quality, and performance.
  • Handled relational and NoSQL databases, ensuring data integrity, and security.
  • Worked in a data-driven organisation delivering advanced analytical solutions.
  • Crafted custom ETL Extract, Transform, Load processes tailored to specific project needs, enhancing data usability.

Education

Bachelor's of Computer Application - Computer science & Engineering

Fairfield Institute of Management and Technology GGSIPU
05.2022

Skills

  • PySpark
  • Apache Spark
  • Azure Data Factory
  • ETL development
  • Google BigQuery
  • Microsoft SQL Server
  • Python
  • Data warehousing
  • SQL expertise
  • Power BI
  • Data modeling

Accomplishments

  • SQL Efficiency Improvement, Reduced data processing time by 40% through optimized SQL script design.
  • ETL Pipeline Management, Managed ETL pipelines ingesting 500GB data daily with 99.8% accuracy.
  • Data Validation Enhancement, Increased accuracy of data validation processes by 25% using improved Python scripts.
  • Data Visualization Success, Visualized complex datasets, improving stakeholder understanding by 60% using Tableau.

Timeline

Data Engineer

Vaco Binary Semantics
06.2024 - Current

Data Engineer

Badaadata
05.2023 - 06.2024

Bachelor's of Computer Application - Computer science & Engineering

Fairfield Institute of Management and Technology GGSIPU
RAMAN GANGWANI