
Senior Data Engineering Manager with 12+ years of expertise, specializing in architecting high-impact data platforms and leading large-scale teams. I led the design and implementation of one of Asia's largest and most scalable Data Lakes, transforming raw data into critical business assets. I am a hands-on leader skilled in building and mentoring high-performing engineering teams from the ground up, ensuring operational excellence and continuous innovation. Currently, I am driving cloud-native data strategy across AWS, GCP, and Azure (including hybrid Cloudera migration) while pioneering the integration of AI/ML and Data Observability platforms. I turn data strategy into cost-efficient, production-ready solutions
Cloud Platforms & Data Ecosystems (Strategic and Server-less)
AWS: EMR(EC2/EKS), Redshift, S3, Glue, Kinesis, MSK, Lambda, Cloudwatch
GCP: BigQuery, Dataflow, DataProc, Pub/Sub, Vertex AI, GCS, Cloud Composer
Azure: Synapse, Databricks, Data Factory, Event Hub
Lakehouse/Warehouse: Databricks, Snowflake, Delta Lake, Iceberg
On Premise: Apache Ecosystem with CDP and HDP
Data Architecture & Modelling (Foundational Design)
Architecture Patterns: Data Mesh, Data Fabric, Data Lakehouse, Data Warehousing
Data Modeling: Dimensional Modeling (Star/Snowflake), Data Vault, Conceptual, Logical and Physical Data Modelling
Database Design: SCD, Indexing, Partitioning, Query Optimisation
Databases: PostgreSQL, MySQL, Hive, NoSQL(MongoDB, CassandraDB, HBase, CosmoDB)
BigData Engineering & Core Compute