Experienced Data Scientist with nearly 4 years in AI, specializing in efficient data pipelines and machine learning solutions. Proficient in Python, PySpark, SQL, and Airflow, with expertise in integrating REST APIs and designing ML pipelines for robust deployment. Skilled in experiment tracking using Weights & Biases (W&B).
Focused on advanced Generative AI (GenAI) applications, including Retrieval-Augmented Generation (RAG) with Large Language Models (LLMs) for financial document QA retrieval and table parsing.
Experienced in training deep learning models with PyTorch and developing ensemble models with hyperparameter tuning using Optuna and Hyperopt.
Strong background in MLOps and FEOps, committed to delivering innovative solutions in both structured and unstructured data domains.
Financial Prediction Service (Sep 2023 – Nov 2024)
QIPP Product (June 2022 – Aug 2023)
GENERATIVE AI Use Cases
Financial Document Parsing Solution
COMPUTER VISION USE CASE
Computer Vision Use Case:
Overview: Developed end-to-end solutions for defect detection in manufacturing and enhanced data visualization for flight simulations.
Steel Defect Detection:
Built a pipeline for identifying defects on assembly lines using feature extraction and image processing techniques (e.g., RLE, image masking).
Implemented binary classification of images using deep learning CNN models (ResNet50) and applied image segmentation to classify defects.
Developed a Streamlit application to showcase the entire defect detection pipeline.
3D Graphical Overlay Tool:
Created a tool for plotting data points on 3D graphical images in a flight simulation domain.
Employed computer vision preprocessing techniques (line detection, axis detection) and utilized a plot digitizer to extract data points.
Developed an automated overlay tool to visually represent data points for better user understanding.
Technologies Used: Keras, TensorFlow, PyTorch, Python, Streamlit, OpenCV, Matplotlib, Seaborn.
• From Fulfilment-to-Fulfilment Centres, scheduled Ad hoc for the nominal trucks and cancelled the ones that are not required based on predictions and ensured on time delivery to customers.
• Using Tableau and SQL, worked on queries pulling middle mile logistical data.
• Oversaw client communications, managed record tracking and data communication activities.
• Led shipping and delivery quality control by managing downtime resulting in large increase in revenue.
• Oversaw every phase of supply chain, from purchase order to delivery to invoicing, targeting100% end-user satisfaction.
•Assisted with research and gained extensive knowledge of Pratham Education focusing on improving the quality of education for underprivileged children across India.
•Furthermore, built a project roadmap, analysed the ASER (Annual Status of Education Report) data over the last one year and analysed factors contributing to low quality in the education System.
•Finally discovered Private School enrolment catching up is huge. Though most children (age 5- 12) continue to get their education from Governments schools, private school enrolment is increasing despite high costs.
Tools used: Excel and Tableau
Technical Skills: