
Staff-level data engineering leader with 14+ years of experience architecting and scaling enterprise-grade data platforms across GCP and AWS. Proven track record of designing petabyte-scale lakehouse solutions, optimizing cross-cloud ETL pipelines, and enabling real-time analytics.Adept at driving org-wide data architecture decisions, building compliant frameworks to deliver cost-efficient, highly reliable systems. Skilled at mentoring engineers into senior roles and influencing technical strategy at the leadership level.
Project: Learning Portal (Client: Cornerstone OnDemand).
Business Impact
1. Architected and delivered a next-generation data platform on AWS Glue, DBT, and BigQuery, migrating from a multi-tenant to a single-tenant DB architecture, achieving a 70% reduction in latency for the UI, and enabling scalable, client-specific analytics.
2. Designed and led the development of an event-driven, real-time pipeline using Kafka and BigQuery to compute leaderboard scores with a 1-hour near-real-time SLA, powering mission-critical product features for thousands of enterprise users.
3. Built an automated GDPR-compliant data deletion framework across AWS and GCP, eliminating manual operations, and ensuring 100% regulatory compliance across client organizations.
1. Defined and implemented a real-time monitoring framework for CDC connectors, publishing health metrics to BigQuery and integrating with Tableau dashboards, thus reducing ETL outages by 50% by catching failures early and taking action promptly.
2. Optimized Airflow orchestration with dynamic scheduling, cutting pipeline delays by 50%, and accelerating the delivery of business-critical dashboards.
3. Reduced cloud costs by 25% via performance analysis and resource tuning in BigQuery, Cloud Run, and Cloud Functions.
4. Performed Elasticsearch optimization by migrating excess data of over 500 GB from a single shard to multiple shards using cluster re-indexing, and reduced bottlenecks for data ingestion.
5. Strategize export, import jobs to migrate data from production to lower environment for data evaluation.
1. Led architecture discussions with business analysts, cross-team leads, and platform architects, leading to a robust design and considering strong business use cases.
2. Mentored a 4-member team, established an early demo, and fostered a fail-fast culture, influenced roadmap priorities, and accelerated project delivery timelines.
Key Technologies: Spark, Airflow, dbt, BigQuery, Glue, Delta Lake, Pub/Sub, ElasticSearch, Cloud Run, Kubernetes, Python, and SQL.
Project: CRM Sales Fusion (SaaS Application) | Product Development | Global Enterprise Clients
Key Technologies: Java, Spring, REST, SQL, Oracle DB, Kubernetes, CI/CD (Jenkins), Oracle Cloud-native microservices