Summary
Overview
Work History
Education
Skills
Websites
Certification
Profile Summary
Timeline
Generic

RAKESH SAI DHEERAJ M

Hyderabad

Summary

  • 7.6 years of experience in Reporting, Predictive Analytics, Machine Learning, Natural Language Processing, Artificial Intelligence, and Generative AI (GenAI).
  • Good knowledge on machine learning techniques like Decision Trees, Random Forest, Logistic Regression, Predictive Modelling, along with LLM-based applications such as Retrieval Augmented Generation (RAG).
  • Proficient in deep learning modules, Prompt Engineering, LangChain, LangGraph, and agent-based frameworks like CrewAI.
  • Skills in analyzing the raw data normality, collinearity & outlier detection and carrying out Descriptive Statistical Analysis, and integrating AI insights into automated workflows using n8n.
  • Good programming knowledge in Python, with experience in building and deploying RESTful APIs using Flask.
  • Hands on Data cleaning and Imputation, feature engineering, and preparing data pipelines for ML and GenAI applica

Overview

8
8
years of professional experience
1
1
Certification

Work History

Data Scientist

InnovaMind Technologies Pvt. Ltd
Hyderabad
01.2023 - Current

Project: Enterprise Knowledge Assistant (GenAI)

Domain: Enterprise Knowledge Management

  • Developed a Generative AI–based knowledge assistant to enable users to query large volumes of internal documents and obtain context-aware, accurate responses.
  • Implemented Retrieval-Augmented Generation (RAG) pipelines to fetch relevant information from document repositories and enhance LLM responses with grounded context.
  • Worked on document ingestion and preprocessing pipelines supporting multiple formats such as PDF, DOC, PPT, TXT, and CSV.
  • Designed and managed LLM workflows using LangChain and LangGraph, enabling structured reasoning, tool calling, and controlled response generation.
  • Applied Prompt Engineering techniques to improve answer accuracy, reduce hallucinations, and ensure consistent response formatting.
  • Built and exposed Flask-based REST APIs for query handling, document retrieval, and response generation.
  • Integrated automation workflows using n8n to orchestrate document updates, indexing, and background processing tasks.

Collaborated with stakeholders to align the assistant’s responses with business-specific terminology and knowledge requirements.

Data Scientist

UV Infratech System pvt ltd.
Hyderabad
09.2021 - 12.2022

Project: AI-powered Question Generation Platform

Domain: Learning & Assessment Platform

  • Worked on developing a web application–based AI platform that accepts documents in multiple formats such as PDF, CSV, PPT, DOC, TXT, and other structured and unstructured inputs.
  • Designed and implemented Generative AI–based pipelines to automatically generate Multiple Choice Questions (MCQs) from uploaded documents.
  • Built logic to generate questions at multiple difficulty levels (Easy, Medium, Advanced) based on content complexity and learning objectives.
  • Integrated LLM-driven workflows with document parsing, content chunking, and contextual understanding for accurate question generation.
  • Developed and exposed Flask-based REST APIs to handle document ingestion, processing, and question generation.
  • Applied Prompt Engineering and Retrieval-Augmented Generation (RAG) techniques to improve relevance, accuracy, and contextual consistency of generated questions.
  • Collaborated with frontend and product teams to ensure seamless integration of AI services into the web application.

Data Scientist

AB- Consultancy
08.2020 - 08.2021

Project: Research on HR management and perform predictive models in HR sector

Client: Internal project (POC)

Domain: HR Solutions

  • Performed the EDA and the distribution plots for understanding the data distribution of the given features in the given input data.
  • Derived the new features by using feature engineering techniques and also performed the feature transformation.
  • A end to end classification model was developed.

Data Scientist

Zara
10.2019 - 07.2020

Project: Server Blocking Prediction

Domain: RETAIL Industry

  • Developed deep learning models to predict the server blocking or not in next 25 minutes based on the features given in the data.
  • Build python scripts using developed models and delivered to the client to run on their servers.
  • Worked on the EDA part to understand the data distribution of the features and identified the features for the model having separable distribution of the output classes.
  • As a part of EDA, build several plots using Plotly and cufflinks to demonstrate the feature distribution of feature to the client.
  • Performed various feature engineering techniques and derived few features from the given data which having more weightage in the model.

Data Scientist

Counterfeit
Hyderabad
06.2018 - 09.2019

Domain: Medicine Manufacture Company

  • Performed the EDA and the distribution plots for understanding the data distribution of the given features in the given input data.
  • Derived the new features by using feature engineering techniques and also performed the feature transformation.
  • A End to End regression model was developed and delivered to the client.
  • A Flask api was created for the developed model.

Education

B.Tech - Electronic and Communication Engineering

Intell Engineering College
Anantapur
04.2015

Intermediate - Mathematics

Narayana Junior College
Anantapur
04.2011

Skills

  • ChainganLMachine Learning
  • Data Science
  • Natural Language Processing
  • Deep Learning
  • Artificial Intelligence
  • Python
  • Predictive Analytics
  • Logistic Regression
  • Predictive Modelling
  • Descriptive Statistical Analysis
  • Data Cleaning
  • Imputation
  • Prompt engineering
  • Generative AI
  • LangChain
  • LangGraph
  • RAG
  • n8n

Certification

Deep learning and Neural Networks, Coursera

Profile Summary

To secure a responsible position in Statistical Analytics in the field of Data Science/Machine learning, artificial intelligence in order to utilize my technical and analytical skills to contribute the long-term and short-term goals of the company., Machine Learning, data science, Natural Language Processing(NLP), Deep Learning, Artificial Intelligence, Python, 2.6 years of experience in Reporting, Predictive Analytics, Machine Learning, Natural Language Processing and artificial intelligence., Good knowledge on machine learning techniques like Decision Trees, Random Forest, Logistic Regression, Predictive Modelling., Proficient in deep learning modules., Skills in analyzing the raw data normality, collinearity & outlier detection and carrying out Descriptive Statistical Analysis., Good programming knowledge in Python., Hands on Data cleaning and Imputation.

Timeline

Data Scientist

InnovaMind Technologies Pvt. Ltd
01.2023 - Current

Data Scientist

UV Infratech System pvt ltd.
09.2021 - 12.2022

Data Scientist

AB- Consultancy
08.2020 - 08.2021

Data Scientist

Zara
10.2019 - 07.2020

Data Scientist

Counterfeit
06.2018 - 09.2019

B.Tech - Electronic and Communication Engineering

Intell Engineering College

Intermediate - Mathematics

Narayana Junior College
RAKESH SAI DHEERAJ M