Experienced Senior Data Engineer adept in Spark, Hadoop, Azure Synapse, ETL, Data Modeling, and Data Warehousing. Proven track record of designing and implementing efficient data solutions. Possesses strong analytical skills, excels in problem-solving, and has a deep understanding of database technologies and systems. Confident in working independently and collaboratively, with excellent communication skills to facilitate seamless interactions.
• Architected and implemented ETL pipelines using Azure Databricks, ADLS, and Delta Lake for scalable big data processing and analytics.
• Enhanced performance and cost efficiency by optimizing Spark jobs, tuning cluster configurations, and reducing storage redundancy through Delta Lake optimizations.
• Integrated Azure Databricks with Synapse Analytics and other Azure services to deliver end-to-end data solutions, ensuring reliability and scalability for business-critical workflows.
Big Data Stack : Hadoop, HDFS, Apache Spark, Python, Scala, Kafka, Hive, Apache Beam
Cloud Stack : Databricks, Azure Synapse, Azure HDInsights, Event Hub, Logic Apps, GCP Dataflow, GCP Cloud Storage, GCP Pub/Sub, Docker
ETL, Informatica Power Center, Data Warehousing, Dimensional Modeling
Unix/Linux Scripting, WSL
DBMS: DB2-Blu, SQL Server, Oracle, Azure SQL, Cosmos DB
CICD, GIT, Ansible, Azure DevOps, Airflow, Mage
Data analysis