Summary

Overview

Work History

Education

Skills

Timeline

Ankit Pandey

Pune

Summary

Data Engineer with over 11 years of experience in developing scalable, cloud-native data systems on AWS and GCP. Expertise in modern data platforms including DBT, Snowflake, Airflow, and Terraform, complemented by strong programming skills in Python, Scala, and Apache Spark. Focused on delivering efficient data pipelines and reliable data products that enhance analytics and support decision-making processes.

Overview

years of professional experience

Work History

Senior Data Engineer

Equal Experts

Pune

07.2023 - Current

Building a modern, cloud-native data platform on AWS for one of the largest sports leagues in the U.S., integrating diverse data sources into a centralized data layer, and delivering curated data products for Marketing, Analytics, and other downstream teams.
Developed scalable ingestion pipelines using AWS Glue, Lambda, SQS, and Kinesis, supporting both batch and streaming data.
Designed and implemented the transformation layer using DBT on Redshift, orchestrated via Airflow, to serve trusted datasets for audience segmentation, campaign performance, and fan engagement analytics.
Contributed to establishing data quality standards and conventions within the team, promoting test coverage, documentation, and governance across DBT models.
Collaborated with cross-functional teams to translate business needs into reliable, production-grade data assets, ensuring quality, performance, and governance through CI/CD, and Terraform automation.

Lead Data Engineer

Clairvoyant EXL

Pune

05.2021 - 07.2023

Create optimal data pipeline architecture and systems by assembling large, complex data sets that meet functional and business requirements on AWS Cloud using EMR, EC2, Glue, Athena, Kinesis Firehose, Step Functions, Lambda, CloudWatch, etc.
Ingest data into the data lake, and develop custom frameworks for ELT, ETL, and services for operating on that data with the use of PySpark and Scala Spark. Pandas, SQL, etc.
Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, redesigning infrastructure for greater scalability, etc.
Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using Python, Scala, Spark, AWS, Terraform, CI/CD, etc.

Technology Analyst

Edgeverve Systems

Dubai, Pune

05.2014 - 05.2021

Part of the digital transformation and modernization of one of the largest financial institutions in the Middle East for building a scalable core data platform.
Built generic frameworks for data ingestion, data transformation using Python and Scala by implementing the TDD approach.
Worked on the enhancement of data quality and existing ETL flows for legacy frameworks.
Write complex Spark jobs for processing huge volumes of data.
Customize traditional data management practices of data quality and data governance for the adoption of AWS Cloud.

Education

Bachelor of Engineering -

Madhav Institute of Technology And Science

Gwalior, MP

04-2014

Skills

AWS
GCP
Python
Scala
DBT

Snowflake
Terraform
Airflow
SQL
CICD

Timeline

Senior Data Engineer

Equal Experts

07.2023 - Current

Lead Data Engineer

Clairvoyant EXL

05.2021 - 07.2023

Technology Analyst

Edgeverve Systems

05.2014 - 05.2021

Bachelor of Engineering -

Madhav Institute of Technology And Science

Ankit Pandey

Summary

Overview

Work History

Senior Data Engineer

Lead Data Engineer

Technology Analyst

Education

Bachelor of Engineering -

Skills

Timeline

Senior Data Engineer

Lead Data Engineer

Technology Analyst

Bachelor of Engineering -

Similar Profiles

Shuvamoy MondalShuvamoy Mondal

Ram KollaRam Kolla

Charan CharamallaCharan Charamalla

PAVAN KUMAR REDDY ANNACHEDUPAVAN KUMAR REDDY ANNACHEDU

Hammad YasirHammad Yasir