Summary
Overview
Work History
Education
Skills
Publications
Certificates And Trainings
Timeline
Generic

Rishu Roshan

Data Engineer
Bengaluru

Summary

Driven Data Engineer skilled at creating scalable data solutions that turn complex challenges into clear business value. Known for building reliable pipelines, optimizing workflows, and collaborating effectively to deliver impactful results in dynamic environments.

Overview

5
5
years of professional experience

Work History

Data Engineer

Yugen Analytics Private Limited
05.2022 - 03.2025
  • Informant: RAG-Based System Design Engine
    Developed a Python library integrating LLMs, Airflow, and Milvus to automate GitHub-based system design querying. Built a complete data pipeline to ingest, chunk, embed, and retrieve information.
    Tech: Python, Apache Airflow, LLM, Milvus, S3
  • GitOperationManager: LLM-Powered GitHub Automation
    Implemented automated Git operations using LLM-suggested GitHub API methods with function calling, streamlining PR and issue workflows.
    Tech: Python, GitHub API, LLM
  • Market Intelligence Platform (MIP)
    Built real-time ETL pipelines for Bungie & Scopely, processing gaming data for 4.5M+ users. Leveraged Airflow & Cloud Composer with monitoring dashboards for scalable deployment.
    Tech: Python, SQL, BigQuery, Cloud Composer, Docker, Looker Studio, Airflow
  • Fabulate – Influencer Discovery Engine
    Designed a low-latency influencer search system using GCP. Built scalable ETL pipelines and high-speed APIs with sub-second response times.
    Tech: Python, SQL, BigQuery, ElasticSearch, Cloud Functions, API Gateway

Data Engineer

L & T Infotech
07.2020 - 05.2022
  • World Bank – IBRD Loan Analysis
    Built distributed ETL pipelines using Spark to analyze cancelled loan data. Integrated MySQL with HDFS/S3 via Sqoop; implemented Kafka for streaming and Tableau for visualization.
    Tech: Python, Spark, Kafka, MySQL, Sqoop, HDFS, S3, Hive, Tableau
  • Enterprise Data Migration
    Contributed to large-scale data migration projects using Talend, focusing on ETL integrity, validation, and transformation.
    Tech: Talend, SQL, ETL

Education

B.Tech - CSE

IIIT Guwahati
01.2016 - 1 2020

Skills

Programming & Scripting:

Publications

  • BDmark: Implementing Blockchain for Big Data Watermarking, Asian Conference on Intelligent Information and Database Systems, Springer, Singapore, 2020
  • Enhancing Security in Cooperative Fog Computing: A Framework for Secure Task Loading, GLOBECOM 2020 - IEEE Global Communications Conference, Taipei, Taiwan, 2020

Certificates And Trainings

  • Cloud Computing Training, NPTEL, 2019
  • Core Java Training, Internshala, 2018
  • Talend Data Integration V7 Developer Certification, 2021
  • Summer Training on PHP, Niks Technology, Patna, 2017
  • Blockchain IoT Workshop Certification, 2019

Timeline

Data Engineer

Yugen Analytics Private Limited
05.2022 - 03.2025

Data Engineer

L & T Infotech
07.2020 - 05.2022

B.Tech - CSE

IIIT Guwahati
01.2016 - 1 2020
Rishu RoshanData Engineer