Experienced Data Platform & Solution Architect with a background in Computer Science and Engineering. Proficient in data modeling, ETL processes, and SQL with expertise in architecture, team leadership, and various ETL tools. Skilled in big data technologies, cloud platforms, and infrastructure. Seeking a role as a Data Architect to leverage my diverse skill set and drive successful data projects.
Overview
10
10
years of professional experience
1
1
Certification
Work History
Data Platform & Solution Architect
Physics Wallah
Noida
07.2023 - Current
Architected a central data platform using Medallion Architecture (Bronze, Silver, Gold) for scalable data processing.
Built a scalable data ingestion framework using JDBC, Debezium, Kafka, and Spark (batch and real-time), reducing latency by 40%, achieving 99.9% uptime, and enabling real-time reporting for more than 5 business units.
Optimized analytics infrastructure by implementing Trino with Iceberg on S3 and dbt, cutting BigQuery query costs by 85% monthly.
Led cross-functional data teams (ETL and platform, 15+ engineers) to drive architecture, governance, and scalable, high-performance data pipelines, delivering measurable business impact.
Senior Principal Engineer
Bharti Airtel (Wynk)
Gurugram
02.2022 - 06.2023
Built a scalable user feature store using Spark, powering real-time, personalized song recommendations for 50M+ users, and actively leveraged by the Data Science team.
Led data warehouse modernization (Redshift, BigQuery) to enhance reporting and decision-making.
Led the implementation of GDPR-compliant data governance and IAM controls, and developed audit pipelines with Deequ and Great Expectations, reducing data loss by 30% and enabling proactive alerts for quality anomalies across critical systems.
Collaborated with engineering, product, and analytics teams to drive data-driven strategies and innovation.
Generated engineering documentation and oversaw design projects.
Lead Data Engineer
Paytm (One97 Communication)
05.2018 - 02.2022
Led data platform migration from in-house to AWS Cloud, migrating 100+ TB of data to Glacier and 300+ ETL jobs to the new platform.
Built and led a 10-member team (DAAS & Platform), optimizing OLAPs, data models, and ensuring a scalable, production-grade platform.
Designed HLDs & LLDs for system performance, Apache Atlas (Lineage), GDS, logging, schema generation, and volumetric analysis.
System Engineer / Data Engineer
Tata Consultancy Services
12.2015 - 05.2018
Developed Big Data pipelines for clients, enabling scalable feature creation and analytics.
Created Hive tables to manage structured data from UNIX, Oracle, and multiple portfolios, optimizing storage and retrieval.
Built BI reports using multi-threaded Sqoop, exporting data to HDFS and Hive for analysis.
Education
B.Tech - Computer Science and Engineering
Jaypee Institute of Information And Technology
Noida, India
07-2015
Senior Secondary Education -
Sir Padampat Singhania Education Center
Kanpur, India
03-2010
High School Education -
B.N.S.D. Siksha Niketan Inter College
Kanpur, India
03-2008
Skills
Data modeling: OLTP, OLAP, star/snowflake, medallion architecture
<ul><li>Roles and Responsibilities</li><li>Hiring and team building</li><li>Managing team’s career progression and performance reviews</li><li>Managing Performance improvement plan and firing</li><li>Mentoring</li><li>Product roadmap planning and execution</li><li>Accountability of the entire delivery execution including the risk mitigation planning.</li><li>Stakeholder management</li><li>Adhering to Apple Privacy and Security Guidelines for platform</li><li>Design and implementation of new architectures and solutions to manage all online services for AdXchange in Apple Ads.</li><li>Design and implementation of new architectures and solutions deliver Budget experimentation in Apple Ads</li><li>Design and implementation of new architectures and solutions to manage data for Ad Platform.</li><li>Creating and maintaining all the data and analytics and revenue pipelines for Ad Platforms</li><li>Driving the solution for data ingestions and data processing across Ad platform</li><li>Defining the Strategy for data governance and data access</li><li>Collaboration with business to deliver relevant data and insight for our strategy and decisions.</li><li>Managing storage, processing, copy/synchronization etc. appropriate to scale</li><li>Enabling machine learning and algorithm groups by proving them right frameworks and data</li><li>Advancing team’s design methodology and quality programming practices and evangelize those techniques across Ad platform.</li><li>Deliveries</li><li>Setup 8+ member team to design and deliver Budget Allocation and Budget experimentation.</li><li>Leading Ad Delivery services team, responsible for delivering various capabilities like query understanding, targeting, fraud etc.</li><li>Delivered and Lead Design for observability for budget, and data platform in Ad platform.</li><li>Setup 11+ members high performing team to handle various business needs such as analytics and micro services for Display Advertising as well as Data Platform</li><li>Instrumental in driving Data Platform throughout the SDLC from inception, design, development, testing and deployment.</li><li>Instilled a strong team culture of high-quality delivery by constructive feedback, cross collaboration, focus and motivation.</li><li>Delivered Realtime and Batch Platform with various integrations</li><li>Delivered new DIP design for Data Platform</li><li>Delivered design of all the components of Data processing platform</li><li>Transitioned Display data work to India seamlessly and delivered many critical business impacting features and organizational security initiatives.</li><li>Closely worked with various stakeholders to understand the requirements which resulted into efficient remote work culture.</li><li>Seamless Display execution and set up best practices for 24X7 production on-call.</li><li>Tools and Technologies</li><li>AWS, S3, Kafka, Spark, Hadoop, EMR, Hive, Scoop, Flume, Cassandra, Airflow, Oozie, Gobblin, Schema Store, Icloud, Grafana, Kubernetes, Vertica, Alation, snowflake, druid, Iceberg, Java, Scala, IntelliJ, Oracle, MySql, Git, RIO, Spinnaker, Elastic cache, Grpc, Agile.</li></ul> at Apple - Adplatform<ul><li>Roles and Responsibilities</li><li>Hiring and team building</li><li>Managing team’s career progression and performance reviews</li><li>Managing Performance improvement plan and firing</li><li>Mentoring</li><li>Product roadmap planning and execution</li><li>Accountability of the entire delivery execution including the risk mitigation planning.</li><li>Stakeholder management</li><li>Adhering to Apple Privacy and Security Guidelines for platform</li><li>Design and implementation of new architectures and solutions to manage all online services for AdXchange in Apple Ads.</li><li>Design and implementation of new architectures and solutions deliver Budget experimentation in Apple Ads</li><li>Design and implementation of new architectures and solutions to manage data for Ad Platform.</li><li>Creating and maintaining all the data and analytics and revenue pipelines for Ad Platforms</li><li>Driving the solution for data ingestions and data processing across Ad platform</li><li>Defining the Strategy for data governance and data access</li><li>Collaboration with business to deliver relevant data and insight for our strategy and decisions.</li><li>Managing storage, processing, copy/synchronization etc. appropriate to scale</li><li>Enabling machine learning and algorithm groups by proving them right frameworks and data</li><li>Advancing team’s design methodology and quality programming practices and evangelize those techniques across Ad platform.</li><li>Deliveries</li><li>Setup 8+ member team to design and deliver Budget Allocation and Budget experimentation.</li><li>Leading Ad Delivery services team, responsible for delivering various capabilities like query understanding, targeting, fraud etc.</li><li>Delivered and Lead Design for observability for budget, and data platform in Ad platform.</li><li>Setup 11+ members high performing team to handle various business needs such as analytics and micro services for Display Advertising as well as Data Platform</li><li>Instrumental in driving Data Platform throughout the SDLC from inception, design, development, testing and deployment.</li><li>Instilled a strong team culture of high-quality delivery by constructive feedback, cross collaboration, focus and motivation.</li><li>Delivered Realtime and Batch Platform with various integrations</li><li>Delivered new DIP design for Data Platform</li><li>Delivered design of all the components of Data processing platform</li><li>Transitioned Display data work to India seamlessly and delivered many critical business impacting features and organizational security initiatives.</li><li>Closely worked with various stakeholders to understand the requirements which resulted into efficient remote work culture.</li><li>Seamless Display execution and set up best practices for 24X7 production on-call.</li><li>Tools and Technologies</li><li>AWS, S3, Kafka, Spark, Hadoop, EMR, Hive, Scoop, Flume, Cassandra, Airflow, Oozie, Gobblin, Schema Store, Icloud, Grafana, Kubernetes, Vertica, Alation, snowflake, druid, Iceberg, Java, Scala, IntelliJ, Oracle, MySql, Git, RIO, Spinnaker, Elastic cache, Grpc, Agile.</li></ul> at Apple - Adplatform