
Aspiring Data Engineer with strong foundations in big data technologies, cloud platforms, and data warehousing. Hands-on experience in building scalable data pipelines using Apache Spark, Hadoop, and Databricks. Skilled in SQL and Python with knowledge of ETL/ELT processes, data governance, and cloud-based architectures on Azure. Passionate about designing efficient, secure, and cost-effective data solutions to support analytics and business decision-making.
Relevant Work Exp ( 2025- Present )
Cloud-Based Data Pipeline using Databricks & Spark
Enterprise Data Warehouse Design
Big Data Processing with Hadoop Ecosystem
Data Governance & Metadata Management (Databricks Unity Catalog)
Programming Language:
Python (OOP Concepts), SQL, Advanced SQL, Linux (Basics)
Big Data & Processing:
Apache Spark (RDDs, Data Frames, Spark SQL, Optimization, Internals), Hadoop, HDFS, Apache Hive
Data Engineering:
ETL/ELT Pipelines, Data Ingestion Patterns, Data Modelling, Data Governance, Metadata Management
Cloud & Platforms:
Databricks, Unity Catalog, Azure Fundamentals
Data Warehousing & System Design :
OLTP, OLAP, Data Lake, Data Warehouse, Fact & Dimension Tables, Star Schema, Snowflake Schema, MicroService Architecture, SQL query optimization