

Results-oriented data professional with extensive experience designing and delivering data solutions for enterprise clients. Skilled in Databricks Lakehouse, Apache Spark™, and Azure, with strong proficiency in Python, SQL, and Spark for building scalable, high-performance ETL pipelines. Proven success in migrating legacy systems to modern big data platforms, optimising data workflows, and translating complex business needs into robust technical architectures. Collaborative team player passionate about leveraging data to drive meaningful business outcomes.
Databricks
Python
SQL
Pyspark
Apache Spark
Apache Airflow
Kafka
Azure
MLOPS
Streaming
GitHub
Data Governance
ETL development
Data pipeline design
Data warehousing
Data modeling
Data migration
Machine learning
Migration from Informatica and Netezza to Databricks
Technologies: Informatica Workflow Manager, Databricks, Python, PySpark, SQL, Git
Migration from SQL Server to Delta Lake
Technologies: SSIS, Databricks, Python, PySpark, SQL, Git
Migration of Hive, Pig, and Shell Scripts from Hadoop to Databricks
Technologies: Databricks, Python, PySpark, SQL, HiveQL, Azure Repos
DBU Consumption Dashboard (Internal Use Case)
Technologies: Databricks AI/BI Dashboards
DPP (Delivery Partner Program) Process Automation (Internal Use Case)
Technologies: Airtable, Databricks AI/BI Dashboards
Databricks Certified Data Engineer Associate