Summary
Overview
Work history
Education
Skills
Certification
Personal Projects
Timeline
Generic

Aditya Kumar Pandey

London,India

Summary

Dynamic Data Scientist with over 3 years of experience, specializing in advanced machine learning and natural language processing solutions. Expertise includes designing multi-modal retrieval systems and developing sophisticated models that enhance data extraction and contextual understanding. Committed to driving impactful results through data-driven strategies and collaborative problem-solving.

Overview

3
3
years of professional experience
5
5
years of post-secondary education
1
1
Certification

Work history

Data Scientist

Capital Numbers
Gurugram, India
02.2025 - 09.2025
  • Designed system architecture for Multi-modal Retrieval-Augmented Generation solution in logistics using GCP services and Vertex AI.
  • Implemented document parsing and chunking strategies to optimize structural data extraction.
  • Increased content generation efficiency for clients by 60%, minimizing manual document processes.
  • Collaborated with cross-functional teams to ensure project outcomes aligned with business objectives.

Data Scientist

Wissen Research
Mohali, India
09.2024 - 01.2025
  • Achieved 70% reduction in manual tasks for research teams through automation.
  • Designed and implemented advanced prompt engineering techniques with OpenAI's GPT-4 for document information extraction.
  • Developed robust REST APIs using Flask for seamless integration of machine learning models.
  • Created proof of concepts (POCs) leveraging LLM for document analysis and comparison.

Data Scientist

Jupitice Justice Technology
Chandigarh
01.2024 - 08.2024
  • Developed sophisticated chatbot leveraging OpenAI Large Language Model with Retrieval-Augmented Generation technology.
  • Automated processes, reducing manual work for legal teams by 60%.
  • Created advanced document and speech summarization models using Large Language Models.
  • Optimised information retrieval processes with Tfidf model, enhancing efficiency by 50%.

Data Scientist

Cognida.ai
Hyderabad, India
08.2022 - 01.2024
  • Engineered super-resolution model with PyTorch, increasing image resolution by 20 times.
  • Developed customised Named Entity Recognition model using NLP BERT for logistics applications.
  • Created RESTful API with Django to facilitate seamless integration of developed models.

Education

Master of Science - Data Science

London School of Economics and Political Science
London, UK
09.2025 - 09.2026

Bachelor of Technology - Computer Science and Engineering

REVA University
Bengaluru, KA
08.2018 - 06.2022

Skills

  • Python
  • NumPy
  • Pandas
  • Scikit-learn
  • Regression (Linear Regression, Decision Tree)
  • Classification (Logistic Regression, Random Forest, XGBoost)
  • NLP (Transformers, BERT)
  • Deep Learning (CNN, LSTM, RNN)
  • TensorFlow
  • PyTorch
  • Databricks
  • AWS
  • Seaborn
  • Matplotlib
  • SQL
  • Docker
  • CI/CD Pipeline
  • MLflow
  • Flask
  • FastAPI

Certification

  • Python For Data Science - IBM
  • Machine Learning with Python - IBM
  • Statistics and Machine Learning for Data Science and AI - Learnbay
  • Generative AI with LLM - Coursera
  • SQL For Data Science - Coursera
  • Microsoft Certified: Azure Data Fundamentals

Personal Projects

Bank Direct Marketing System - Machine Learning

  • Led the implementation of a Bank Direct Marketing System predictive model utilizing Python programming and supervised machine learning techniques for targeted customer outreach.
  • Developed a predictive model using the Random Forest algorithm and performed data analysis, data processing, Exploratory Data Analysis (EDA), and Data Visualization using Seaborn and Matplotlib on the dataset.
  • Achieved notable success in demonstrating the efficacy of utilizing algorithms like Logistic Regression and XGBoost.


Grammar Correction - NLP

  • Build a grammar correction model that reviews spelling, grammar, punctuation, clarity, and delivery mistakes in English texts and corrects the identified errors.
  • The project seeks to enhance the quality and professionalism of content by leveraging Python and NLP NER (Name Entity Recognition) techniques.

Timeline

Master of Science - Data Science

London School of Economics and Political Science
09.2025 - 09.2026

Data Scientist

Capital Numbers
02.2025 - 09.2025

Data Scientist

Wissen Research
09.2024 - 01.2025

Data Scientist

Jupitice Justice Technology
01.2024 - 08.2024

Data Scientist

Cognida.ai
08.2022 - 01.2024

Bachelor of Technology - Computer Science and Engineering

REVA University
08.2018 - 06.2022
Aditya Kumar Pandey