

Data Engineer with a strong software engineering background and a Master’s degree in Data Analytics. Experienced in building scalable data pipelines, lakehouse architectures, and analytics-ready datasets using SQL, Python, Apache Spark, and AWS. Skilled in batch and near real-time data processing, data modeling, and performance optimization. Adept at delivering reliable data platforms that support analytics and business decision-making.
Data Engineering: ETL/ELT Pipelines, Data Pipelines, Lakehouse (Bronze/Silver/Gold), Batch Processing
Big Data: Apache Spark, PySpark, Spark SQL
Programming: SQL (Advanced), Python, JAVA
Cloud (AWS): S3, Glue, Athena, Lambda, SQS
Data Architecture: Data Modeling, Schema Enforcement, Partitioning, Optimization
Tools: Git, GitHub, Airflow (basic), Docker (basic)