Summary
Overview
Work History
Education
Skills
Work History
Websites
Timeline
Generic
Sayantani Nath

Sayantani Nath

Senior Data Engineer ( Information Architect- Modeler)

Summary

Data architecture professional with 13 years of experience in delivering mission-critical data solutions for banking, healthcare, and manufacturing industries. Demonstrated ability to convert complex business needs into efficient data architectures, resulting in significant performance enhancements. Skilled in cloud-native data warehouse design, data pipeline orchestration, and comprehensive data governance practices. Proven leadership in architecture standards and team mentoring, fostering collaboration among diverse teams.

Overview

3
3
years of professional experience

Work History

Lead Data Architect

Hexaware Technologies (IQVIA – Bayer)
  • Global Data Model for healthcare analytics — standardised enterprise data architecture across 15+ programmes globally.
  • Global Data Model: Owned IQVIA GDM — standardised enterprise definitions across 15+ healthcare analytics programmes globally; delivered full STM documentation with SCD strategies and loading patterns.
  • OLAP Design: Designed star/snowflake schemas for OLAP workloads on Snowflake; profiled data on Hive/Impala to validate model correctness and enforce data quality standards at global scale.
  • Database Performance Optimisation: Analysed and optimised query performance across Hive/Impala analytical clusters; tuned Snowflake warehouse sizing and clustering strategies for multi-geography healthcare reporting workloads; achieved 35% query latency reduction.
  • Data Architecture Documentation: Designed source-to-target mappings and data lineage architecture using ERwin and MagicDraw; created enterprise data dictionary defining conformed dimensions and fact structures across healthcare domains.
  • Data Vault POC: Prototyped Data Vault 2.0 hub-satellite-link architecture for integrating multiple ERP sources — validated design patterns for auditability and scalability in multi-source environments.
  • Stakeholder Alignment: Translated healthcare reporting requirements from 10+ global business teams into governed logical models with full traceability from concept to physical implementation.

Senior Data Reporting & Visualisation Engineer

The Dow Chemical Company
  • OLAP Warehouse: Kimball-style dimensional models for manufacturing analytics — conformed dimensions, factless facts, aggregate fact tables; served 200+ users with sub-5-second query SLAs on multi-billion-row datasets.
  • Database Performance Optimisation: Designed advanced Oracle/SQL Server schemas with optimised clustered/non-clustered indexes and partition strategies for high-throughput manufacturing data ingestion; implemented execution plan tuning for 30+ critical reporting queries.
  • ETL & Data Pipeline: Optimised data pipeline performance and pipeline orchestration; managed complex ETL workflows with embedded data quality controls for multi-million row daily loads.

Senior Software Engineer / Data Engineer

FundsIndia
  • OLTP Architecture: Designed relational DB schema for a robo-advisory platform on AWS RDS; handled ETL automation, query tuning, and backup/recovery for a fintech investment product.
  • Database Performance Optimisation: Tuned critical trading and portfolio evaluation queries on AWS RDS PostgreSQL; optimised transaction processing for high-frequency operations with sub-second latency requirements.

Senior Consultant / Database Engineer

Virtusa Polaris (Citibank – FX eDealer)
  • FX Trading Platform: High-throughput OLTP stored procedures for 500K+ daily FX transactions across 100+ countries — optimised critical query paths reducing deal matching latency by ~30% for a tier-1 bank.
  • Database Performance Optimisation: Engineered high-performance transaction processing schemas; optimised Oracle execution plans for real-time FX pricing queries; implemented connection pooling and caching strategies for sub-millisecond response times in mission-critical trading systems.

AVP / Senior Data Architect

Citi Corp (CSIPL)
11.2022 - Current
  • Enterprise data architecture across DCRM, CEAM, Metrics & Dataverse — analytics, regulatory reporting, and API-driven consumption for a global bank.
  • Data Modelling: Designed conceptual → physical models across 4 domains; defined authoritative grains, referential integrity, and reusable patterns adopted by 10+ engineering squads.
  • OLTP & OLAP: Designed and optimised transactional and analytical schemas concurrently — millions of records/day across 3 source systems; ~40% reduction in source-to-mart latency via schema refactoring and index redesign.
  • Database Performance Optimisation: Rewrote 20+ high-cost SQL queries; implemented clustering and partition-pruning strategies on Snowflake — ~45% storage reduction, ~50% query runtime improvement on high-volume fact tables; optimised index structures across Oracle and SQL Server schemas.
  • Snowflake Warehouse: Star/snowflake schemas, SCD Type 2 (MERGE + effective-date ranges), surrogate key frameworks — supports regulatory reporting with full historical traceability.
  • Reporting Layer: Built 40+ SQL Views and Materialised Views on Snowflake powering Tableau dashboards consumed by business and regulatory reporting users across Metrics domain — covering KPI scorecards, trend analyses, and reconciliation reports with sub-3-second refresh SLAs.
  • Data Architecture Documentation: Created comprehensive source-to-target mappings, data dictionary, and data lineage documents using ERwin and MagicDraw — enabling cross-functional teams to understand enterprise data flows and governance rules across domains.
  • Data Vault POC: Designed and prototyped Data Vault 2.0 architecture for a regulatory reporting use case — demonstrated scalability, auditability, and flexibility advantages; informed roadmap for enterprise-wide adoption.
  • ETL/ELT Pipeline Design: Architected and implemented ETL/ELT pipelines using Python for data extraction, transformation, and loading from multiple source systems into Snowflake; optimised incremental load patterns with full idempotency and SCD compliance.
  • Data Quality: Automated validation framework (null/duplicate/referential checks + anomaly alerting) — >40% reduction in defect escape rate across regulatory reporting pipelines.
  • Consumption Layer: Kafka → Redis → GraphQL API architecture serving 10+ application teams; eliminated legacy DB-polling patterns; standardised low-latency governed data access.
  • Leadership: Design authority for enterprise modelling standards; mentored junior data architects across squads on Kimball, Data Vault 2.0, Snowflake, and CI/CD practices.

Education

B.Tech - Computer Science & Engineering

National Institute of Technology
Agartala
5 2013

Skills

  • Advanced SQL
  • Python programming
  • ETL development
  • Data security
  • Data modeling
  • Performance tuning
  • Data warehousing
  • NoSQL databases
  • Master and Reference Data
  • Kafka streaming
  • Distributed systems
  • Hadoop ecosystem
  • Data governance
  • Normalization techniques
  • Database structures
  • Data architecture
  • Data lineage
  • Data warehousing expertise
  • Database design
  • Data quality management
  • Database optimization

Work History

01-2020

Timeline

AVP / Senior Data Architect

Citi Corp (CSIPL)
11.2022 - Current

Lead Data Architect

Hexaware Technologies (IQVIA – Bayer)

Senior Data Reporting & Visualisation Engineer

The Dow Chemical Company

Senior Software Engineer / Data Engineer

FundsIndia

Senior Consultant / Database Engineer

Virtusa Polaris (Citibank – FX eDealer)

B.Tech - Computer Science & Engineering

National Institute of Technology
Sayantani NathSenior Data Engineer ( Information Architect- Modeler)