Gaurav Singh - Data Engineer - SITA

Summary

Aspiring Data Engineer with strong foundations in big data technologies, cloud platforms, and data warehousing. Hands-on experience in building scalable data pipelines using Apache Spark, Hadoop, and Databricks. Skilled in SQL and Python with knowledge of ETL/ELT processes, data governance, and cloud-based architectures on Azure. Passionate about designing efficient, secure, and cost-effective data solutions to support analytics and business decision-making.

Overview

4

years of professional experience

Work History

Data Engineer

SITA

New Delhi

08.2022 - Current

Relevant Work Exp ( 2025- Present )

Cloud-Based Data Pipeline using Databricks & Spark

Designed and implemented scalable data pipelines using Databricks and Apache Spark.
Built reusable data ingestion patterns for structured and semi-structured data.
Processed large datasets using Spark DataFrames and Spark SQL.
Optimized performance through partitioning, caching, and query tuning.

Enterprise Data Warehouse Design

Designed dimensional data models using Star and Snowflake schemas.
Created fact and dimension tables for analytical workloads.
Implemented ETL workflows using SQL and Apache Hive.
Ensured data consistency and quality through validation checks.

Big Data Processing with Hadoop Ecosystem

Managed distributed storage using HDFS.
Queried and transformed data using Apache Hive.
Built batch processing workflows for large-scale datasets.

Data Governance & Metadata Management (Databricks Unity Catalog)

Applied data governance principles using Unity Catalog.
Managed metadata and ensured compliance with data standards.
Improved data discoverability and documentation.

Education

B. Tech - Electronics And Communication Engineering

AMITY SCHOOL OF ENGINEERING AND TECHNOLOGY

New Delhi

06-2018

Skills

Programming Language:

Python (OOP Concepts), SQL, Advanced SQL, Linux (Basics)

Big Data & Processing:
Apache Spark (RDDs, Data Frames, Spark SQL, Optimization, Internals), Hadoop, HDFS, Apache Hive

Data Engineering:
ETL/ELT Pipelines, Data Ingestion Patterns, Data Modelling, Data Governance, Metadata Management

Cloud & Platforms:
Databricks, Unity Catalog, Azure Fundamentals

Data Warehousing & System Design :
OLTP, OLAP, Data Lake, Data Warehouse, Fact & Dimension Tables, Star Schema, Snowflake Schema, MicroService Architecture, SQL query optimization

Timeline

Data Engineer

SITA

08.2022 - Current

B. Tech - Electronics And Communication Engineering

AMITY SCHOOL OF ENGINEERING AND TECHNOLOGY