Summary
Overview
Work History
Education
Skills
Certification
Projects
Timeline
Generic

Kratagya Yadav

Gurgaon

Summary

As a Data Engineer having a proven track record of transforming raw data into valuable assets. Expertise includes developing efficient data pipelines, implementing data warehouses, managing databases, and migrating systems to the cloud. Additionally, I posses strong skills in data analytics, SQL, and Power BI, enabling to extract meaningful insights and create compelling visualizations.

Overview

7
7
years of professional experience
1
1
Certification

Work History

Data Engineer

Fidelity Investments
08.2020 - Current
  • Designed and implemented a comprehensive fact and dimensional models encompassing multiple fact and dimension tables.
  • Engineered a robust data pipeline capable of efficiently managing 10M+ transactions daily across production databases.
  • Effectively oversaw the management of over 125TB of production data, resulting in a notable 50% reduction in both data load and query times.
  • Tech Stack: Python, Javascript, Snowflake, SQL, Control-m, Airflow, AWS kinesis, AWS S3

Executive Graduate Trainee

Fidelity Investments
08.2019 - 07.2020
  • Implemented a comprehensive data quality framework and developed automated daily checks to ensure data accuracy and integrity.
  • Created a Power BI dashboard for monitoring data quality, integrating data from Snowflake with Change Data Capture (CDC) to provide real-time insights.
  • Tech Stack: Javascript, Snowflake, SQL, Power BI

Data Science Intern

InMobi
05.2018 - 07.2018
  • Developed new features for CVU(conversion rate) prediction by using matrix factorisation method and app ownership profile for user
  • Created 9-bit unique profile for millions of users by analysing there app usage pattern and used that profile for targeted ad experience
  • Pre-processing and analysing many TB’s of data(including data from all ad networks like Google, Facebook etc) with help apache spark(Big Data language) and applied different machine learning models on it using ML lib
  • Tech Stack : Spark, Scala and Python.

Hydrometallurgy of Radioactive Minerals

Department of Atomic Energy
05.2017 - 06.2017
  • Analysed various radioactive minerals and calculated percentage of different radioactive element in sample.
  • Heavy mineral separation by using different techniques like concentrating table, bromoform separation, isodynamic separator etc.

Education

B.Tech. - Metallurgical & Materials Engineering

Indian Institute of Technology
Roorkee, India
05.2019

Senior Secondary -

Neerja Modi School, Jaipur (CBSE)
Jaipur
01.2015

Skills

  • ETL development
  • Data Warehousing
  • Data Pipeline Design
  • Data Migration
  • SQL and Databases
  • Power BI

Certification

  • Natural Language Processing with Probabilistic Models
  • Natural Language Processing with Classification and Vector Spaces
  • Sequence Models
  • Structuring Machine Learning Projects
  • Neural Networks and Deep Learning
  • Snowflake Basics - cloud data warehouse

Projects

Minimizing CO2 production from Blast Furnace using Machine Learning, Metallurgical and Materials Engineering, IIT Roorkee, 01/2018, 03/2018, Predicting CO2 production based on various factors like Temperature, Pressure, input materials, etc. Attempt to minimize its production. Conversion Rate Prediction for Ads, Data Science Intern | InMobi, 05/2018, 07/2018, Developed new features for CVU(conversion rate) prediction by using matrix factorization method and app ownership profile for a user. Created a 9-bit unique profile for millions of users by analysing there app usage pattern and used that profile for a targeted ad experience. Pre-processing and analysing many TB’s of data(including data from all the ad networks like Google, Facebook etc) with the help apache spark(Big Data language) and applied different machine learning models on it using ML lib. Worked on the US region data and analysed the user response to a particular ad on the basis of age group, handset version, demand market id etc. Tech-Stack: Spark, Scala and Python. Established the need for a recommendation system for Glance(An InMobi Product), May 2018 - July 2018, 05/2018, 07/2018, Analysed the state wise user news viewing pattern for Samsung and Gionee handset users. Created a categorical news viewing experience from the analysed data of past usage. Tech-Stack: Spark, Scala and Python. XTRAC Datalake, Data CoE Fidelity Investments, 01/2020, Present, Developed the architecture for Fact and Dimension model in snowflake. Wrote queries to populate the Raw and Prepared zone. Implemented tokenization of data in while data movement and during storing of data. Build a generic function for load test and performance testing to calculate latency. Optimization of queries which lead to data movement between landing and prepared within a minute. Created a framework to check data loss between source and snowflake. Build an automatic which helps in check validity of data between source and snowflake at regular interval of time. Onboarded multiple BU’s on xtrac datalake such as WI, FILI and FI/PI. Implementation of mapping logic to map workitem data with workitem history data. Olympus Datalake, FDA Fidelity Investments, 02/2021, Present, Developed the queries for data movement to prepared zone. Developed the python code for data movement which was scheduled using airflow dags. Data Quality Framework, FDA Fidelity Investments, 01/2021, 10/2021, Implementation of data quality framework and wrote the data quality check which runs on everyday basis. Developed the dashboard for data quality in Power Bi which consumes data from snowflake on CDC basis.

Timeline

Data Engineer

Fidelity Investments
08.2020 - Current

Executive Graduate Trainee

Fidelity Investments
08.2019 - 07.2020

Data Science Intern

InMobi
05.2018 - 07.2018

Hydrometallurgy of Radioactive Minerals

Department of Atomic Energy
05.2017 - 06.2017

B.Tech. - Metallurgical & Materials Engineering

Indian Institute of Technology

Senior Secondary -

Neerja Modi School, Jaipur (CBSE)
Kratagya Yadav