Summary
Overview
Work History
Education
Skills
Certification
Timeline
Hi, I’m

MAHIPAL SINGH RATHORE

Data Scientist
MAHIPAL SINGH RATHORE

Summary

Experienced Developer with 4.5 years of expertise in designing and implementing scalable data-intensive solutions, driven by a passion for solving real-world challenges. Committed to following industry-standard quality and best practices, including test-driven development, CI/CD, and documentation maintenance. Possesses exceptional algorithmic and analytical skills that inform decision-making. Looking forward to contributing my expertise to your team.

Overview

5
years of professional experience
5
years of post-secondary education
2
Certifications

Work History

Blue Altair

Data Scientist
12.2022 - Current

Job overview

Blue Altair | Duration: 1 Year

  • Question Answering AI System: Developed a robust Question Answering AI system, specializing in extracting data (tables and text) from PDFs, Docs, and Text files and enables users to receive intelligent answers, Which significantly enhances data accessibility for users.
    Demonstrated proficiency in utilizing tools such as Streamlit, Haystack, and Langchain. Designed and implemented the backend in Python using Flask APIs for the Question Answering AI system. Developed robust APIs to load AI models and facilitated interaction with different tools, ensuring seamless integration.
    Applied expertise in leveraging large language models, including GPT-3, BERT, T5, etc., to optimize generative or extracted answers.
    Implemented end-to-end solutions for efficient storage and retrieval of data in vector databases like Milvus.
  • ETL Pipelines and Azure Integration: Led the development of ETL pipelines for ingesting diverse flat files from sources like Sharepoint and OneDrive.
    Utilized Azure Data Factory for seamless data orchestration and transfer.
    Employed Azure Function for efficient preprocessing of files, enhancing the overall data processing workflow.

TCS Digitate

Data Scientist / Python Developer
06.2019 - 12.2022

Job overview

Digitate | Duration: 3.5 Years

  • Auto AI Platform Development: Played a key role in the development of an Auto AI platform aimed at automating the creation and deployment of machine learning pipelines.
    Engineered pipelines for diverse datasets, including text, images, and statistical data.
    Designed and implemented pipelines for text and structured data, incorporating essential steps such as Exploratory Data Analysis (EDA), Training, and Model Inference.
    Leveraged MLflow to seamlessly manage the end-to-end machine learning lifecycle.
  • Streamlined backend development using Python and Flask APIs to facilitate seamless interaction between Java and AI pipelines. Employed RabbitMQ for efficient queue management, ensuring effective communication between Java components and AI pipelines. Orchestrated a Python server to handle requests from Java components, call specific pipelines, and track completion. Managed memory and space utilization for pipelines, queuing calls when necessary resources were unavailable. Integrated Flask APIs for streamlined communication with EDA pipelines and result sharing among components.
  • Technological Expertise: Utilized a broad spectrum of ML frameworks, including Tensorflow, PyTorch, Transformers, scikit-learn, MXNet, AutoSklearn, T-pot, and Auto-Weka.
    Implemented Explainable AI using SHAP (Shapley Additive Explanations) to provide transparent insights into the model's decision-making process.
  • Performance Evaluation and Maintenance: Employed drift detection techniques to assess model performance over an extended period.
    Conducted timely model retraining to ensure sustained accuracy and relevancy.
  • End-to-End Deployment on Azure: Orchestrated the complete deployment of machine learning pipelines on Azure using tools such as Docker and Jenkins.
  • Specialized Pipelines: Engineered specialized pipelines tailored for Procurement at Tata Chemicals and automated of HR operations.

Education

ARMY PUBLIC SCHOOL
JODHPUR

from 12TH IN SCIENCE AND MATHEMATICS
03.2013 - 04.2014

University Overview

ARMY INSTITUTE OF TECHNOLOGY
PUNE

from COMPUTER ENGINEERING
06.2015 - 05.2019

University Overview

Skills

    Machine learning frameworks : TensorFlow, Keras, PyTorch, scikit-learn, XGboost, Hyper-opt, ML-Flow, etc

undefined

Certification

Coursera :Deep Learning Specialization

Timeline

Data Scientist
Blue Altair
12.2022 - Current
Data Scientist / Python Developer
TCS Digitate
06.2019 - 12.2022
ARMY INSTITUTE OF TECHNOLOGY
from COMPUTER ENGINEERING
06.2015 - 05.2019
ARMY PUBLIC SCHOOL
from 12TH IN SCIENCE AND MATHEMATICS
03.2013 - 04.2014
MAHIPAL SINGH RATHOREData Scientist