Summary
Overview
Work History
Education
Skills
Projects - ML
Projects - GenAi
Certification
Timeline
SoftwareEngineer

Jithin N

Data Scientist
Bangalore

Summary

Data Science Professional with 7+ years of experience, holding a Postgraduate degree in Data Science & Business Analytics from Austin Texas University, and a Bachelor's in Electronics and Communication Engineering. Certified in Microsoft Azure Generative AI (Great Learning) and completed an ML internship at WorldQuant University. Proficient in leading ML and Generative AI projects, with deep expertise in engineering data analytics. Skilled at deriving actionable insights from complex datasets, with a strong focus on innovation and continuous improvement. Eager to apply my skills in a dynamic, forward-thinking Data Science role.

Overview

9
9
years of professional experience
3
3
Certifications

Work History

Data Scientist

Arcolabs
02.2024 - Current
  • LLM Document Automation: Streamlined document creation processes with Mistral LLM, improving efficiency and reducing manual effort.
    Technologies: Mistral, Python, PyTorch.
  • RAG-based RCA Chatbot: Designed a chatbot using a RAG system to retrieve and summarize RCA documents, speeding up root cause analysis.
    Technologies: LLaMA, GPT, FastAPI, and Elasticsearch.
  • Analyst Allocation Model: Implemented a classification model with XGBoost to optimize analyst assignments, increasing task efficiency based on performance metrics.
    Technologies: XGBoost, Python, Scikit-learn, Jupyter Notebook.

Assistant Manager - Data Science (Engineering)

Jubilant Pharmova Ltd.
03.2022 - 06.2023
  • Drug Classification Model: Developed a model to classify drugs based on chemical properties and therapeutic use, improving classification accuracy by 15%.
    Technologies: Python, XGBoost, Scikit-learn.
  • Predictive Maintenance: Built a Random Forest model to predict oxygen analyzer maintenance needs, evaluated with recall and F1-score.
    Technologies: Python, Scikit-learn.
  • Real-Time Analytics Dashboard: Created a Tableau dashboard for live data tracking and actionable insights in manufacturing.
    Technologies: Tableau, SQL, Excel, and Python.

Data Science Consultant

Ideobiz InfoTech
12.2019 - 01.2022
  • Inventory Management Optimization: Implemented a Decision Tree classification model to optimize inventory management in the retail industry. Used feature engineering and cross-validation to improve model accuracy, resulting in better stock control and an annual cost saving of R2.3 million.
    Technologies: Python, Decision Tree, SQL.
  • Tableau Dashboard for Delivery Optimization: Used regression analysis to identify bottlenecks in the delivery system, leading to a 25% reduction in delivery time. Built interactive Tableau dashboards to monitor delivery performance and extract actionable insights.
    Technologies: Tableau, SQL, Python.

Junior Officer - Engineering (data Analytics)

Apotex PharmaChem PVT Ltd
06.2015 - 06.2018
  • Customized Analytics Reports: Created automated reports in Tableau and Excel, using SQL for real-time data monitoring and process optimization.
    Technologies: Tableau, SQL, Excel.

Education

Post Graduation - Data Science And Business Analytics

Great Lakes
04.2001 -

Bachelor of Engineering Technology - Electronics & Communications

T John Institute of Technology
04.2001 -

Skills

TOOLS: Tableau, MySQL, KNIME, Excel, DockerIDE: Jupyter Notebook, Google Colab, PyCharmMachine Learning: Classification, Regression, Linear & Logistic Regression, Decision Trees, Random Forest, NLP, Boosting & BaggingStatistical Methods: T-test, Chi-Square, PCA, Hypothesis Testing, ANOVAMETHODS: Data Wrangling, Missing & Outlier Treatment, Univariate & Bi-Variate Analysis, Bootstrap & Cross-ValidationAPI Development: FastAPI, Flask, StreamlitCloud Platforms: Microsoft Azure, AWS SageMakerVersion Control: Git/GitHubDeployment: Docker, AWS Lambda, Azure FunctionsGenerative AI & Prompt Engineering: RAG-based systems, summarization tasks with LLaMA, GPT, Mistral

Projects - ML

  • A/B Testing at WorldQuant University

Conducted an experiment to determine if sending reminder emails to applicants increases the likelihood of completing the admission exam.
Skills & Tools: Chi-Square Test, Odds Ratio, CDF, ETL, MongoDB, Exploratory Data Analysis (EDA)


  • Volatility Forecasting in India

Developed a GARCH time series model to predict asset volatility. Acquired stock data via API, performed data cleaning, stored it in a SQLite database, and built an API to serve real-time model predictions.
Skills & Tools: GARCH Model, SQL, REST API, Walk Forward Validation (WFV), ACF & PACF Plots


  • Air Quality in Nairobi

Skills & Tools: Linear Regression, Decision Tree, Random Forest, Boosting Techniques, EDA


  • Earthquake Damage in Nepal

Skills & Tools: Clustering, PCA, Data Mining, K-Nearest Neighbors, Decision Tree

Projects - GenAi

  • RAG-based Chatbot

Developed a chatbot using a Retrieval-Augmented Generation system for accurate document retrieval and response generation.
Technologies: Mistral -Nemo, Streamlit


  • Document Summarization

Built a tool for summarizing PDFs and Word documents using fine-tuned LLMs to generate concise summaries.
Technologies: LLaMA, PyTorch.


  • AI Report Generation

Developed an automated report generator to extract business insights from datasets using fine-tuned LLMs.
Technologies: LLaMA, Tableau, Excel.


  • Sentiment Analysis Tool

Built a sentiment analysis and summarization tool for customer reviews, providing actionable insights.
Technologies:  GPT, NLTK.

Certification

Applied Data Science Lab, Machine Learning - (World Quant University)

Timeline

GenAi Microsoft Azure - (Great Learning)

09-2024

Data Scientist

Arcolabs
02.2024 - Current

Applied Data Science Lab, Machine Learning - (World Quant University)

11-2023

Assistant Manager - Data Science (Engineering)

Jubilant Pharmova Ltd.
03.2022 - 06.2023

Data Science Consultant

Ideobiz InfoTech
12.2019 - 01.2022

Junior Officer - Engineering (data Analytics)

Apotex PharmaChem PVT Ltd
06.2015 - 06.2018

Post Graduation - Data Science And Business Analytics

Great Lakes
04.2001 -

Bachelor of Engineering Technology - Electronics & Communications

T John Institute of Technology
04.2001 -
Jithin NData Scientist