Driven Data Engineer skilled at creating scalable data solutions that turn complex challenges into clear business value. Known for building reliable pipelines, optimizing workflows, and collaborating effectively to deliver impactful results in dynamic environments.
Overview
5
5
years of professional experience
Work History
Data Engineer
Yugen Analytics Private Limited
05.2022 - 03.2025
Informant: RAG-Based System Design Engine
Developed a Python library integrating LLMs, Airflow, and Milvus to automate GitHub-based system design querying. Built a complete data pipeline to ingest, chunk, embed, and retrieve information. Tech: Python, Apache Airflow, LLM, Milvus, S3
GitOperationManager: LLM-Powered GitHub Automation
Implemented automated Git operations using LLM-suggested GitHub API methods with function calling, streamlining PR and issue workflows. Tech: Python, GitHub API, LLM
Market Intelligence Platform (MIP)
Built real-time ETL pipelines for Bungie & Scopely, processing gaming data for 4.5M+ users. Leveraged Airflow & Cloud Composer with monitoring dashboards for scalable deployment. Tech: Python, SQL, BigQuery, Cloud Composer, Docker, Looker Studio, Airflow
Fabulate – Influencer Discovery Engine
Designed a low-latency influencer search system using GCP. Built scalable ETL pipelines and high-speed APIs with sub-second response times. Tech: Python, SQL, BigQuery, ElasticSearch, Cloud Functions, API Gateway
Data Engineer
L & T Infotech
07.2020 - 05.2022
World Bank – IBRD Loan Analysis
Built distributed ETL pipelines using Spark to analyze cancelled loan data. Integrated MySQL with HDFS/S3 via Sqoop; implemented Kafka for streaming and Tableau for visualization. Tech: Python, Spark, Kafka, MySQL, Sqoop, HDFS, S3, Hive, Tableau
Enterprise Data Migration
Contributed to large-scale data migration projects using Talend, focusing on ETL integrity, validation, and transformation. Tech: Talend, SQL, ETL
Education
B.Tech - CSE
IIIT Guwahati
01.2016 - 1 2020
Skills
Programming & Scripting:
Publications
BDmark: Implementing Blockchain for Big Data Watermarking, Asian Conference on Intelligent Information and Database Systems, Springer, Singapore, 2020
Enhancing Security in Cooperative Fog Computing: A Framework for Secure Task Loading, GLOBECOM 2020 - IEEE Global Communications Conference, Taipei, Taiwan, 2020
Certificates And Trainings
Cloud Computing Training, NPTEL, 2019
Core Java Training, Internshala, 2018
Talend Data Integration V7 Developer Certification, 2021
Summer Training on PHP, Niks Technology, Patna, 2017
Insights Analytics Manager, Advanced Analytics at Commonwealth Bank of AustraliaInsights Analytics Manager, Advanced Analytics at Commonwealth Bank of Australia
Lead SERVICE ANALYST - RESILIENCY MANAGER at Societe Generale Global Solutions Center Pvt Ltd, Bangalore | Bangalore, INDIALead SERVICE ANALYST - RESILIENCY MANAGER at Societe Generale Global Solutions Center Pvt Ltd, Bangalore | Bangalore, INDIA