Summary
Overview
Work History
Education
Skills
Certification
Interests
Timeline
Generic

Tanya Gupta

Bangalore

Summary

Data Engineer with over 5 year track record of leading cloud migration and data lake initiatives for enterprise-scale environments. Proficient in Snowflake, GCP, AWS, and modern ETL frameworks, with expertise in data modeling, PII compliance, and performance optimization. Experienced in mentoring teams, implementing DevOps automation, and delivering governance and playbooks for long-term data strategy.

Overview

5
5
years of professional experience
1
1
Certification

Work History

Lead Data Engineer

DataGrokr Analytics Private
01.2025 - Current
  • Led end-to-end design and development of a scalable AWS data lake following Well-Architected principles.
  • Architected multi-layered storage (Raw, Secure, Staging, Normalized, Reporting) using S3, Iceberg, Snowflake, and Athena.
  • Built metadata-driven ETL pipelines with AWS Glue, Lambda, and Step Functions for JSON data ingestion and normalization.
  • Implemented PII handling with secure raw storage and hashed identifiers for compliance.
  • Enabled self-service analytics by replicating Power BI dashboards and optimizing Snowflake queries.
  • Developed KPI monitoring and automated alerts using metadata-driven thresholds.

Data Engineer

DataGrokr Analytics Private
08.2020 - Current
  • Created data pipelines, developed stored procedures, and tested data with Snowflake while streaming data from Kafka.
  • Managed data history and successfully executed a large-scale data warehouse migration project from LDW to EDM, ensuring data integrity and performance optimization within the supply chain ecosystem.
  • Developed and maintained data models for ThoughtSpot dashboard to analyze customer behavior trends, providing valuable insights to management.
  • Collaborated on diverse projects, including implementing trend-specific attributes in analytical dashboards, working with cross-functional teams to gather requirements, and building streaming data pipelines within GCP services - Dataflow and BigQuery.

Education

Bachelors of Technology (B.Tech) - Computer Science

Chitkara University
07.2021

Semester Exchange - undefined

Western Sydney University
12.2018

Skills

  • Cloud : GCP (BigQuery, Composer, Dataflow), AWS
  • Databases/ETL : MySQL, Big Query, Databricks, Snowflake
  • Analytical Tools : PowerBI, ThoughtSpot
  • Programming Languages : Python, Shell Scripting, Spark, Scala, JavaScript
  • Others : Git, CICD/DevOps, GenAI, ML, Solution Architecture

Certification

  • Google Certified Professional Data Engineer - 2023-09
  • Databricks Lakehouse Fundamentals - 2023-05
  • Machine Learning (Andrew Ng) | Stanford Univ - 2021-12

Interests

Traveling and exploring food

Experimenting with emerging tech like LLM.

Timeline

Lead Data Engineer

DataGrokr Analytics Private
01.2025 - Current

Data Engineer

DataGrokr Analytics Private
08.2020 - Current

Semester Exchange - undefined

Western Sydney University

Bachelors of Technology (B.Tech) - Computer Science

Chitkara University
Tanya Gupta