Summary
Work History
Education
Skills
Websites
Activities
Roles And Responsibilities
Projects
Timeline
Generic

SHUBHAM SAXENA

Pune,MH

Summary

Data Scientist with an M.S. in Data Science from IIT Indore and IIM Indore, skilled in machine learning, NLP, and data analytics, has worked on projects involving NLP and ML. Proficient in Python, R, and SQL, I excel in building robust data models, implementing machine learning algorithms (such as Linear Regression, Logistic Regression, SVM, and Random Forest), employing deep learning frameworks (like CNN, RNN, LSTM, and Transformers), statistical modeling (Hypothesis Testing, A/B Testing, Time Series Analysis, Multivariate Statistics with Principal Component Analysis PCA), and advanced analytics.

Work History

Data Scientist

ibrow Technology Pvt. Ltd. (Client- Google)
04.2024 - Current
  • I worked on Implicit Code Execution (ICE) for Gemini, enhancing the model's ability to understand and execute embedded code snippets within natural language prompts
  • Contributed to the development and optimization of this feature, improving its accuracy and versatility for supervised fine-tuning of LLMs.

Associate Specialist - Data Science Intern

Merck, Sharp & Dohme (MSD)
  • NLP project that involved information extraction from a natural language input and creating a PowerPoint slide automatically by extracting the dataframe name, X & Y axis labels, and chart type that will help with automation
  • I used transfer learning with BERT model, NER Model, and fuzzy logic matching.

Executive Engineer

Honda
- 03.2022
  • Spearheaded cross-functional efforts with the Supplier Quality Assurance Team to enhance quality standards of sheet metal and engine parts, examined production data to identify patterns and root causes of defects, defect rates, production volumes, and supplier quality metrics, and it led to targeted improvements in the manufacturing process, resulting in a 10% decrease in post-development defects.

Education

Master of Science - Data Science And Management

IIT Indore & IIM Indore
03.2024

B.E in Mechanical -

Thapar University, Punjab
06.2018

Skills

  • Programming language Python, R
  • Data analysis and manipulation Numpy, Pandas, SQL, NLTK, spaCy, Scipy
  • Data visualization Matplotlib, Seaborn, Plotly, Tableau, Power BI
  • Machine learning Scikit Learn, Tensorflow, Keras
  • Deep learning CNN, RNN, LSTM, GRU, Transformers, Topic Modelling, PyTorch, Tensorflow, Generative AI (in progress)
  • Natural language processing (NLP) NLTK, SpaCy, Transfer Learning (BERT, Llama), NER, POS, Topic Modeling, Text preprocessing, Chatbot
  • Collaboration tools Jupyter Notebooks, Google Colab, Git, RWDEx
  • Database management MySQL, AWS Redshift
  • Cloud computing skills AWS Sagemaker, S3 bucket, AWS Certification (in progress)
  • Other skills A/B testing, Marketing, Communication, Team Management

Activities

  • Semi-Finalist in Nationwide Analytics Case Study Competition, IIT Kanpur 2022
  • Sports and Gym Head of Hostel Committee 2017

Roles And Responsibilities

  • Class Representative, IIT Indore & IIM Indore, 03/22/2022, 03/24/2024
  • Sports & Gym Head, Thapar University, 07/15/2015, 07/16/2016

Projects

BERTified Sentiments- Aspect-Based Sentiment Analysis (ABSA) with BERT & LDA, It aims to develop a robust and accurate sentiment analysis system that goes beyond traditional sentiment analysis by identifying and analyzing sentiments associated with specific aspects of a text on a financial news dataset using capabilities of BERT and Latent Dirichlet Allocation (LDA) to identify the latent topic categories and the sentiments associated with them and got an accuracy of 87%., Python, BERT, LDA, Transfer Learning Autocategorization for Customer Grievance Redressal, Created a GPT 3.5 LLM system for auto-categorization of customer grievances for efficient redressal. The system redirects the complaint to the respective department / Ministry of the Government of India., OpenAI API, LLM, LDA, BERT Item-Item Collaborative Recommender System, Build an item-item Collaborative Recommender System on a Books dataset and deploy it using Flask., Python, Statistics Predictive Model for Healthcare Industry, Build a predictive model for treatment cost for a hospital using Elastic Net Regression., Python, Linear Regression, Statistics, Predictive Modeling, LASSO Regularization, Ridge Regularization, Linear Regression

Timeline

Data Scientist

ibrow Technology Pvt. Ltd. (Client- Google)
04.2024 - Current

Associate Specialist - Data Science Intern

Merck, Sharp & Dohme (MSD)

Executive Engineer

Honda
- 03.2022

Master of Science - Data Science And Management

IIT Indore & IIM Indore

B.E in Mechanical -

Thapar University, Punjab
SHUBHAM SAXENA