Summary
Overview
Work History
Education
Skills
Affiliations
Languages
Websites
Certification
Timeline
Generic

Gaurav Singh

Hyderabad

Summary

Data Scientist with 11+ years of experience in providing data-driven solutions to increase efficiency, accuracy in both product as well as consulting. Working as Senior Data Scientist at Relience Jio.


Key accomplishments includes

  • Designed a LLM based system that reduced manual effort and third-party cost for attribute mapping across different vendors for the Ajio platform by 62% and decreased processing time by 95%.
  • Designed and implemented a cattle health monitoring system that enabled owners to take timely action, resulting in improved livestock health, reduced veterinary costs, and increased revenue by 27%.
  • Designed and developed a crop anomaly detection system that accurately identified potential issues such as pest infestations, nutrient deficiencies, and water stress. This system enabled farmers to take timely corrective actions, reducing crop losses by 25% and increasing overall yield by 18%, thereby improving their profitability.
  • Implemented deep learning-based solutions and integrated it to Oracle Cloud Infrastructure(OCI), which enhanced the OCI's capabilities.
  • Developed a road sign detection and geo-tagging system leveraging camera calibration techniques. This solution accurately identified and localized road signs, providing precise geo-coordinates for navigation and mapping applications. The system improved road safety and navigation efficiency by enhancing map data accuracy and enabling real-time updates. It reduced manual work by 90%.
  • Developed a lidar data visualization and classification tool with C++, which has reduced the cost of paid tools.
  • Implemented Home Credit Risk Analysis system using machine learning models to assess customer creditworthiness. The solution improved risk prediction accuracy by 20%, enabling more informed lending decisions and reducing default rates.
  • Mentored 7 interns in Jio, Mentored hundreds of students/professionals in (Scalar) from basics of ML to building real world ML systems.
  • Highly skilled in Applied Machine Learning/ Deep Learning, Computer Vision and Language Processing

Overview

11
11
years of professional experience
1
1
Certification

Work History

Sr. Data Scientist

Reliance Jio Platefarm Ltd
Hyderabad
09.2021 - Current

Designed and implemented a cattle activity tracking system.

  • Automatically analyze behavioral patterns from videos, and improve decision-making processes.
  • Following solution components: cattle detection, tracking, occlusion handling, a centralized server, and video stream were implemented.
  • Dashboard for activity insights and alerts.

Farm geofencing.

  • Detected farms and other land types using Mask R-CNN.
  • Extracted the boundary of each detected mask and calculated the latitude and longitude of each pixel boundary.
  • CUDA, C++ programming, and multithreading are used to optimize services, which have improved response time by 70%.

Pest, disease, and deficiency detection in crops.

  • Create one classifier model and one detection model (Mask R-CNN) for each crop. Classifier model for healthy and anomaly classification, and detection of anomalies.
  • Used TensorFlow Serving to serve the model, which has improved the response time of the service by 50%.

Personalized Advisory to Farmers.

  • Provides personalized and context-aware agricultural advisory to farmers.
  • Fine-tuned Llama2 model for chat response, context processing, and created RAG from the appropriate advisory database.

Attribute mapping of retail products.

  • Fully fine-tuned/trained two LLM-based models, one for identifying the category of products, and the other to map attribute values into the format of AJIO.
  • Realized cost savings of 62% in manual mapping processes.

Sr. Data Scientist

Oracle India Pvt. Ltd.
Hyderabad
04.2020 - 09.2021

Researched and implemented smart image cropping for product cataloging.

  • Automatically identify and crop the most relevant portions of an image, enhancing visual focus, and improving optimized and efficient content for OCI cloud infrastructure.
  • Used saliency detection, edges using Laplace, and regions high in saturation.

Innovated methods to streamline automatic document structure.

  • Automatically organizes text documents into appropriate directories based on content and context.
  • Created a transformer-based BERT model.

Enhanced searchability of products in an e-commerce-based application on OCI infrastructure.

  • Enhancing visual content understanding, metadata generation, and better tagging of images for e-commerce products.
  • Used image caption technique.

Technical Lead Computer Vision

Cyient Ltd.
Hyderabad
03.2019 - 04.2020

Research on object detection in point cloud/LiDAR data for autonomous vehicles.

  • Developed cutting-edge solutions for identifying and classifying objects in three-dimensional environments.
  • Used deep learning and linear algebra-based features from LiDAR data to create a new deep learning model.
  • Tested other SOTA algorithms, like PointNet++ and SVM.

Road Sign Detection and Geo-Tagging Using Camera Calibration.

  • To automatically detect road signs from images or video streams captured by cameras installed on vehicles or traffic infrastructure.
  • This technology has significant implications for road safety, traffic management, and navigation systems.

Lidar data visualization and classification tool.

  • Engineered a C++ tool enabling the visualization and automatic classification of LiDAR data points.
  • In tool development, C++ and its libraries, OpenGL, GDAL, etc., have been used.

Software Engineer in Machine Learning

Barclays
Pune
01.2018 - 03.2019

Employee identity verification using facial recognition and ID numbers with existing records in the database.

  • Implemented an advanced verification system combining facial recognition and ID number checks.
  • Pre-indexing embeddings with approximate nearest neighbor (ANN) search is used for fast processing.
  • Facenet and OCR have been used for face and ID number, respectively.

Implemented the Home Credit Risk Analysis system.

  • Assessing the creditworthiness of customers and recommending credit positivity based on their credit records from different portfolios.
  • Designed features including credit utilization ratio, days past due metric, income-to-debt ratio calculation, and portfolio diversity index evaluation.
  • Implemented a classification and regression model to predict risk levels and scores of continuous credit risk levels, respectively.

Applied Machine Learning Engineer

Synechron Technology Pvt. Ltd.
Pune
01.2017 - 01.2018

Credit card transaction fraud detection.

  • Developed system to detect fraudulent transactions instantly and accurately, reducing false positives.
  • Train models using historical labeled data (fraudulent vs non-fraudulent).
  • Applied XGBoost, LightGBM, Random Forest algorithms as well as CNN and RNN deep learning models for handling sequential data.

Parking lot detection for satellite images POC.

  • Implemented a POC to identify parking lots, and counted the number of available parking spaces.
  • Used OpenCV for thresholding, Canny edge detector, contour detection

Data Analyst

Nihilent Technology Pvt. Ltd.
Pune
08.2013 - 01.2017

Developed a rule-based TV channel suggestion system.

  • To provide viewers with personalized channel package recommendations based on predefined rules and user preferences.
  • Curated content tailored to past viewing history and genre preferences.

Statistical Analysis of Channel Demands.

  • To understand the demand for products or services through different distribution channels.

Backend development of a web-based TV channel subscription application.

  • Worked on backend development and used Java, Spring Boot.
  • Special package subscription module.

Education

Bachelor of Technology - Computer Science And Engineering

Jay Pee Institute of Information Technology
Noida, UP
06-2013

Skills

Computer Vision

Activity Classification, Image Processing, Classification, Object Detection, Image Segmentation, Image Captioning, Video Analytics, Face Recognition, OCR, CNN, ResNet, VGG, InceptionV3, DenseNet, Faster RCNN, Mask RCNN, YOLO, Face net, TSN, Uniformer2, ViT


NLP

Abstractive Summarization, Topic Modeling, NER, Key Phrase extraction, Personal Advisor, RNN, LSTM, TF-IDF, Tokenizer


Generative AI-EDSRGAN, SRGAN, LLM: BERT, RoBerTa, Flan-T5, GPT 3, MIXTRAL, Llama 31, 32, vLLM, sglang, Huggingface, Unstoth


Classical ML – Regression, Ensemble learning, Clustering Algorithms, Feature Engineer, Data Wrangling etc


Frameworks & Libraries
– Pytorch, Tensorflow, Keras, Scikit-Learn, Opencv, Pandas, Numpy, NLTK, Pytorch-C, TensorRT, ONNX, transformer, huggingface, mmdetection, mmaction etc


MLOps – TFX, MLflow


Cloud and Other Tech – AWS (SageMaker, S3, EC2, Lambda), GCP(VertexAI, GCS), Docker, Kubernetes, Fast API, Flask, Kafka, Multithreading, SQL, MongoDB, Postgress


Programming – Python, C, C, CUDA, Java

Affiliations

  • Travel Enthusiast
  • Mentorship
  • Sports

Languages

Hindi
First Language
English
Proficient (C2)
C2

Certification

  • Deep Learning Specialization
  • Generative AI Specialization

Timeline

Sr. Data Scientist

Reliance Jio Platefarm Ltd
09.2021 - Current

Sr. Data Scientist

Oracle India Pvt. Ltd.
04.2020 - 09.2021

Technical Lead Computer Vision

Cyient Ltd.
03.2019 - 04.2020

Software Engineer in Machine Learning

Barclays
01.2018 - 03.2019

Applied Machine Learning Engineer

Synechron Technology Pvt. Ltd.
01.2017 - 01.2018

Data Analyst

Nihilent Technology Pvt. Ltd.
08.2013 - 01.2017

Bachelor of Technology - Computer Science And Engineering

Jay Pee Institute of Information Technology
Gaurav Singh