Summary
Overview
Work History
Education
Skills
Timeline
Generic
Prafull Sharma

Prafull Sharma

Senior Data Scientist
Paris

Summary

With experience managing ML-powered products for IBM, Siemens, and Amadeus, as well as startups in Europe and Asia, I have a proven track record in international environments. My open-source NLP contributions and widely-cited research paper (11k+ downloads) underscore my expertise. I have experience in developing language models from scratch and am passionate about open-source projects.

Overview

9
9
years of professional experience

Work History

Senior Data Scientist

Knowdis.ai
8 2021 - Current
  • Leading Product development of AI products related to diverse domains: such as E-commerce, and AdTech domains.
  • Communicated with C-level executives from the client (Big MNCs) side, understanding their tech needs and requirements
  • Usecases - Search Engines using NLP, (and multimodal models), Recommendation engines, Search Query attribute extraction, Search Question understanding. Search query typo correction etc.
  • Leading the development of Own in-house Large multi million parameter Language Models from scratch (Pretraining from scratch) for AdTech and E-commerce domain.
  • Developed generative language model (Decoder based) models for AdTech domain and E-commerce domain (Developed both, own pretrained model as well as Latest LLMs, Mixtral, Flan, Instruction tuned LLMs)
  • Developed Search engine, and recommendation models using Transformers based multimodal LLMs.
  • Developed RAG based engines using LlamaIndex, LangChain, OpenAI assist, and in-house proprieary RAG scheme for domain augmentation.
  • Deployment - Lead the deployments of LLMs in production, quantisation and speed optimisation of models.

Head of Engineering (AI and Product)

Mieux.ai
12.2021 - 12.2022
  • Mieux.ai is a Paris and UK based startup specializing in developing content solutions (Text, Video generated Data Using AI)
  • Developed Generative Language models for generating SEO optimised content for Product catalogs, Langing Pages, Google Ads, Facebook Ads.
  • Joined as an Early co-founding member as Head of Engineering and tech for the AI product suite
  • Managed and grew tech team from 3 to 15 Data Scientists, and frontend engineers from scratch
  • Created technical specifications and helped execute them to realize the product
  • Helped in the scaling of AI models and infrastructure to make the product stable and production ready.

Senior Data Scientist (AdTech)

Naister SaS
04.2019 - 07.2021
  • Lead development of data science features in Product
  • Owner for Client product-related communications - UK, and US clients
  • Led full-stack and Data Science team for product development
  • Understanding client requirements and creating solutions around them in communication with upper management
  • Hired and managed a tech team of data scientists, and UX designers for the development of SaaS products
  • Published an open-source tech on NLP Keyword Extraction which served as a source technology for popular Python NLP package KeyBERT and BERTopic
  • Title - 'Self-Supervised Contextual Keyword and Keyphrase Retrieval with Self-Labelling'

Research Asst - Machine Learning

Amadeus France
09.2018 - 02.2019
  • Machine learning solution development for Flight Search Engine Product
  • GBM, Random forest, Decision tree models based model for improving the flight-search product
  • Feature Engineering & selection for developing machine learning solutions
  • Skills used - Machine learning, Python, Numpy, Pandas, Sklearn, H2O.ai, Flask Hadoop
  • Search Engine Recommendation Scoring and Ranking using Machine Learning, Deep Learning.

Software engineer

SIEMENS
01.2017 - 02.2018
  • Developed question-and-answer cognitive chatbot using LSTM, encoder-decoder architecture, NLP, Deep learning concepts, and Rasa NLU for Human Machine Interface(HMI) product
  • Developed Text classification model for Log Data, and customer interaction data
  • Test Driven development with unit testing of modules to ensure performance
  • Front-End Development and back-end integration of models
  • Skills used - Python, HTML, CSS, Tensorflow.

Software Engineer - Python, Big Data

IBM
05.2015 - 12.2016
  • Migrated large volumes of data from Traditional Databases to Hadoop Ecosystem
  • Extracted the data from other data sources into HDFS using Sqoop
  • Handled importing of data from various data sources, performed and worked on cloud data orchestrator
  • Developed Churn Model Machine Learning Solution for IBM's clients.

Education

MSc in Applied Data Science & Artificial Intelligence -

DSTI
Paris

Skills

  • Python

  • Pytorch

  • NLP

  • AWS

  • Model Deployments

Timeline

Head of Engineering (AI and Product)

Mieux.ai
12.2021 - 12.2022

Senior Data Scientist (AdTech)

Naister SaS
04.2019 - 07.2021

Research Asst - Machine Learning

Amadeus France
09.2018 - 02.2019

Software engineer

SIEMENS
01.2017 - 02.2018

Software Engineer - Python, Big Data

IBM
05.2015 - 12.2016

Senior Data Scientist

Knowdis.ai
8 2021 - Current

MSc in Applied Data Science & Artificial Intelligence -

DSTI
Prafull SharmaSenior Data Scientist