Summary
Overview
Work History
Education
Skills
Languages
Summary Of Key Projects Handled
Personal Information
Projects
Timeline
Generic

Ansuman Das

Bangalore

Summary

A seasoned Data Scientist with over 9 years of expertise Generative AI and Machine Learning, including specialties in LLM, Deep Learning, and MLOPS within the Google Cloud Platform (GCP). Google Cloud certified Machine Learning Engineer. End to end Model deployment and data pipeline experience on GCP Platform. Experience in Team Management and stakeholder Management.

Overview

11
11
years of professional experience

Work History

Lead Data Scientist

Ericsson India
04.2021 - Current

Manager, Analytics and Machine Learning

HSBC Global Analytics Center
04.2019 - 11.2020

Manager, Advanced Analytics

Tata Steel
04.2018 - 01.2019

Data Scientist

Infosys
06.2016 - 03.2018

Machine Learning Engineer

Phenom People
10.2015 - 03.2016

Associate Consultant (Analytics Domain)

Capgemini
07.2013 - 10.2015

Education

B. Tech - Computer science

IIIT Bhubaneswar
01.2013

+2 Science -

College of Basic Science and Humanities
01.2009

10th -

BSE Odisha Board
01.2007

Skills

  • Generative AI
  • Large Language Model
  • LLM Finetuning
  • Langchain Framework
  • Python
  • R
  • Machine Learning
  • Regression Techniques
  • Classification Models
  • Deep Learning (ANN, CNN, RNN)
  • Dimensionality Reduction
  • Advanced Natural Language Processing
  • Transformers
  • Time series Models
  • GCP Vertex AI
  • GCP Big Query
  • Teradata
  • SQL server

Languages

English
Hindi
Oriya
Bengali

Summary Of Key Projects Handled

  • Interacting ChatBot Using Large Language Model (LLM) for Network Operators
  • Customer Complain Category Prediction using LLM .
  • Real Time Telco network performance Monitoring using Machine Learning Algorithm
  • End to End Tracking of Customer's digital Journey for retail banking.
  • Predicting Customer Realistic Total Wallet and Share of wallet Using Quantile Regression Methodology
  • Predicting Invoice Payment Delay Probability Using Different Algorithms
  • Similar Job Suggestion Using Topic Modeling

Personal Information

  • Date of Birth: 07/12/92
  • Nationality: Indian

Projects

1. Utilized Hugging Face's pretrained opensource LLM to convert natural language text into SQL queries, employing various models such as llama2, codeLlama 34B, codeLlama 7B, and Mistral 7B. Implemented diverse prompt techniques for enhanced SQL query accuracy. Integrated Chroma vector database and RAG to store domain and database information, facilitating dynamic schema generation.  Executed the generated SQL queries on a Vertica database, obtaining answers for subsequent delivery to network operators via a interactive user interface.  

Models and Framework used : CodeLama 7B,13B,34B, T5 Spyder,Mistral 7B,Prompt engineering (Zero shot,Fewshot and COT)RAG,Langchain

2. We finetuned small LLMs, including TinyLLM, Phi-1.5, and Facebook/Opt-1.3b, using domain-specific data to classify customer complaints effectively. When a user submits a complaint, our finetuned LLM classifies it into a specific category, enabling the resolution team to address issues promptly. We employed both adopter and lora methodologies for this task and implemented quantization mechanisms to optimize model loading and memory usage. The finetuned models exhibited a significant 25% increase in accuracy compared to the base model, showcasing their effectiveness in multi-class classification tasks.

3.Developed a predictive model for dynamic threshold values of Radio Access Network (RAN) Key Performance Indicators (KPIs), utilizing clustering techniques for network cell grouping. The model predicts thresholds across 2G, 3G, 4G, and 5G KPIs, incorporating boosting-based multioutput regression and statistical anomaly detection algorithms. Deployed on Google Cloud Platform (GCP) and Vertex AI, the model includes a live data comparison system generating support tickets. Network operators rely on the Real-Time Performance Monitoring (RTPM) tool for day-to-day operations, deployed across various clients, showcasing its versatility. Dashboard creation leverages Tableau and Google Data Studio.

Models and Framework used: Multioutput Regression, Anomaly detection, GCP Vertex AI, Bigquery, BigqueryML, MLops,catboost algorithm

4. Implemented a quantile regression model for a prominent US office stationary supplier, predicting B2B customers' realistic total wallet and Share of Wallet (SOW) based on previous year sales and demographics. The model clusters customers based on their share of wallet, providing valuable insights for strategic decision-making.

5. Designed a predictive model for three clients to forecast invoice payment delay probabilities, identifying critical invoices with higher outstanding amounts and a greater likelihood of late payment in the future. Leveraged various invoice and customer-level variables in the model creation process. 

Models and Framework used Logistic Regression, Random Forest. Employed R for modeling, Tableau for visualization, and PostgreSQL for data storage.


Timeline

Lead Data Scientist

Ericsson India
04.2021 - Current

Manager, Analytics and Machine Learning

HSBC Global Analytics Center
04.2019 - 11.2020

Manager, Advanced Analytics

Tata Steel
04.2018 - 01.2019

Data Scientist

Infosys
06.2016 - 03.2018

Machine Learning Engineer

Phenom People
10.2015 - 03.2016

Associate Consultant (Analytics Domain)

Capgemini
07.2013 - 10.2015

B. Tech - Computer science

IIIT Bhubaneswar

+2 Science -

College of Basic Science and Humanities

10th -

BSE Odisha Board
Ansuman Das