Summary
Overview
Work History
Education
Skills
Timeline
Generic

Gaurav Singh

New Delhi

Summary

Aspiring Data Engineer with strong foundations in big data technologies, cloud platforms, and data warehousing. Hands-on experience in building scalable data pipelines using Apache Spark, Hadoop, and Databricks. Skilled in SQL and Python with knowledge of ETL/ELT processes, data governance, and cloud-based architectures on Azure. Passionate about designing efficient, secure, and cost-effective data solutions to support analytics and business decision-making.

Overview

4
4
years of professional experience

Work History

Data Engineer

SITA
New Delhi
08.2022 - Current

Relevant Work Exp ( 2025- Present )

Cloud-Based Data Pipeline using Databricks & Spark

  • Designed and implemented scalable data pipelines using Databricks and Apache Spark.
  • Built reusable data ingestion patterns for structured and semi-structured data.
  • Processed large datasets using Spark DataFrames and Spark SQL.
  • Optimized performance through partitioning, caching, and query tuning.

Enterprise Data Warehouse Design

  • Designed dimensional data models using Star and Snowflake schemas.
  • Created fact and dimension tables for analytical workloads.
  • Implemented ETL workflows using SQL and Apache Hive.
  • Ensured data consistency and quality through validation checks.

Big Data Processing with Hadoop Ecosystem

  • Managed distributed storage using HDFS.
  • Queried and transformed data using Apache Hive.
  • Built batch processing workflows for large-scale datasets.

Data Governance & Metadata Management (Databricks Unity Catalog)

  • Applied data governance principles using Unity Catalog.
  • Managed metadata and ensured compliance with data standards.
  • Improved data discoverability and documentation.

Education

B. Tech - Electronics And Communication Engineering

AMITY SCHOOL OF ENGINEERING AND TECHNOLOGY
New Delhi
06-2018

Skills

Programming Language:

Python (OOP Concepts), SQL, Advanced SQL, Linux (Basics)

Big Data & Processing:
Apache Spark (RDDs, Data Frames, Spark SQL, Optimization, Internals), Hadoop, HDFS, Apache Hive

Data Engineering:
ETL/ELT Pipelines, Data Ingestion Patterns, Data Modelling, Data Governance, Metadata Management

Cloud & Platforms:
Databricks, Unity Catalog, Azure Fundamentals

Data Warehousing & System Design :
OLTP, OLAP, Data Lake, Data Warehouse, Fact & Dimension Tables, Star Schema, Snowflake Schema, MicroService Architecture, SQL query optimization

Timeline

Data Engineer

SITA
08.2022 - Current

B. Tech - Electronics And Communication Engineering

AMITY SCHOOL OF ENGINEERING AND TECHNOLOGY
Gaurav Singh