Results-oriented data professional with extensive experience designing and delivering data solutions for enterprise clients. Skilled in Databricks Lakehouse, Apache Spark™, and Azure, with strong proficiency in Python, SQL, and Spark for building scalable, high-performance ETL pipelines. Proven success in migrating legacy systems to modern big data platforms, optimising data workflows, and translating complex business needs into robust technical architectures. Collaborative team player passionate about leveraging data to drive meaningful business outcomes.
Databricks
Migration from Informatica and Netezza to Databricks
Technologies: Informatica Workflow Manager, Databricks, Python, PySpark, SQL, Git
Migration from SQL Server to Delta Lake
Technologies: SSIS, Databricks, Python, PySpark, SQL, Git
Migration of Hive, Pig, and Shell Scripts from Hadoop to Databricks
Technologies: Databricks, Python, PySpark, SQL, HiveQL, Azure Repos
DBU Consumption Dashboard (Internal Use Case)
Technologies: Databricks AI/BI Dashboards
DPP (Delivery Partner Program) Process Automation (Internal Use Case)
Technologies: Airtable, Databricks AI/BI Dashboards
Databricks Certified Data Engineer Associate