Summary
Overview
Work History
Education
Skills
Certification
Accomplishments
MY TIME
INDUSTRY EXPERTISE
NLP & LLM Models
AIML and Big Data
Data Science Toolkit
Additional Information
Software
Timeline
Generic

PARAG HEMANT SONAR

Generative AI Engineer
Pune,MH

Summary

As Data Scientist 5+ years of experience, familiar with Data Scraping, Data Crawling, Predictive Modeling, Cleaning & Organizing Data ranging for Business Use. My Interest Includes working on Projects Offering Learning and Understanding in the fields of Artificial Intelligence, Machine Learning, Deep Learning, Natural Language Processing, Time Series Analysis, Sentiment Analysis, Big Data Analytics and Data Visualisation.

Overview

7
7
years of professional experience
6
6
Certifications
3
3
Languages

Work History

Senior Software Development Engineer - AI & ML

Optimum Solutions
06.2024 - 06.2025
  • Designed and Deployed Retrieval Augmented
  • Work on Segmentation: Used Transaction Level and Demographic Data to Build Custom Customer Profiles. An Ensemble of K-means and Business rules were Applied to Build the Segments using Python.
  • Fraud Detection Analytics: Identify Fraudulent Transaction Metrics by Building a Fraud Detection Model using Classification Models. Build an ANN Based Model for Predicting the Declined Transaction.


  • Project: LLM Based Intelligent Fraud Detection System.
  • Description: Built a Fraud detection pipeline integrating anomaly Detection and LLM powered explainability using RAG. Enabled simulation of new fraud patterns and real time alerts with interpretable justifications.
  • Tools: Python, GPT – 4, FAISS, Pyspark, Langchain, Streamlit, MongoDB

Data Scientist – Product Development

Grihum Housing Finance
03.2023 - 07.2023
  • Company Overview: https://grihumhousing.com/
  • Identify Valuable Data Sources and Automate Operational processes using Data Analytics & Machine Learning Techniques.
  • Used SQL Query and Power Query for End-to-End Product Sales Data Pipeline. It Helps for Analyze Large Amounts of Banking Product Information using Python, R - Programme to Discover Trend Analysis and Patterns Recognition.
  • Build Predictive Models and Machine-Learning Algorithms.
  • Created Data Visualizations and Data Crawling tool like Python and R Programming and for Visualization using Power - Bi and presented it Technology & Production Team.

Analytics Manager – Data Science

PNG Jewellers
08.2022 - 04.2023
  • Company Overview: https://www.pngjewellers.com/
  • Implemented Automated Data reduction and evaluation with Map Reduce and Hadoop to reduce the process from eight (8) weeks to eight (8) hours.
  • Created Power - BI Dashboard for Bridging the Gap Between Technology and the Retail Business Using Data Analytics through Deliver Data-Driven Recommendations and Reports to Executives and Directors.
  • Developed and Deploy Advanced Statistical Models, Predictive Models & Learning Methods (Random Forrest Method, Decision Tree Method, SVM, NLP, Gradient Boosting, SupervisedUnsupervised Learning, Clustering, Classification and Regression Modelling).

Data Analyst – Product Development

Reality Premedia Services
11.2021 - 06.2022
  • Company Overview: https://www.realitypremedia.com/
  • Developed and Maintain End to End Database Using SQL Query and Data System in Reorganise and Readable format for Client like (PWC, Deloitte).
  • Use Cluster analysis And Sentimental Analysis for Online Reviews for Quality Check with Product and Services.
  • Use Statistical and Programming Tools like (SPSS, STATA, R - Programme, Excel, SQL, Python, and Power - Bi, Hadoop).
  • Using Apache Hadoop creating investment Models and Trading algorithms.
  • Worked on NLP Problem Such as Sentiment Analysis, Named Entity Recognition and Key Phrase Extraction (Text Analytics), Money Laundering, Nonparametric Statistical Model, RNN, Chatbot.
  • Segmentation: Used transaction level and demographic data to build Custom Customer profiles. An ensemble of K-means and Business rules applied to build the segments using python. These are used by teams to understand shifts in households behaviour over time and better target marketing and advertising of products to the correct Consumer profile.

Data Analyst

Innoplexus
03.2019 - 05.2021
  • Company Overview: https://www.innoplexus.com/
  • Saved around 70% resource time from Planning and Execution.
  • Saved around 80% resource time using Data Visualization tools like Power - Bi, MS - Excel, Python, R Programming, Hadoop in Pharma Domain for Data Forecasting.
  • Worked to Design an Advanced Neural Network based Model that can Predict the Probability of Success of Clinical trials with Precision of 85%.
  • Increased Data Scraping Capabilities & Data Quality by 4X.
  • Raised Code Quality through Data reviews that helps to reduce development problems by 60%.
  • Developed an Internal Web - Based Solution Database, Bringing team resolution rates from 60% to 90% in 3 Months.
  • Created the Power - BI Dashboard for Client like Commerzbank Group, Pfizer, Ranbaxy. Used Python and SQL for Time Series Analysis and Cluster Analysis.
  • Build and Maintain Database Pipeline of Heart Rate Data and SQL Queries to track the operational Productivity. It hel Patient Health Record for Sensor Detection Model.

Associate Data Analyst

Innoplexus
07.2018 - 03.2019
  • Company Overview: https://www.innoplexus.com/
  • Help clients to make decisions based on facts & data driven solutions using AI/ML Service offered are drug discovery, clinical trails predictions, biomarkers, sentiment analysis etc.
  • Initiated the implementation of Scrum Methodology, resulting in 40% increase in Quality and Productivity.
  • Perform Pharma Clinical Trials Data collection and a variety of Statistical Analysis using R - Programming, SQL, Python and Tableau, Power - Bi.
  • Develop and Deliver Customized Solution for the biggest banks and Finance Companies in more than 6 countries.
  • Worked Closely with a team of Data Engineers and Data Scientist to improve the efficiency Clinical Trials Prediction Engine by 67%.
  • On Daily basis 30+ bugs were fixed.
  • Used Microsoft Azure Health Data Services for Patient Heart Rate Data for Sensor Detection in Product Development using Azure Databricks.

Technical Apprentice

Innoplexus
02.2018 - 07.2018
  • Company Overview: https://www.innoplexus.com/
  • Help clients to make decisions based on facts & data driven solutions using AI/ML Service offered are drug discovery, clinical trails predictions, biomarkers, sentiment analysis etc.
  • Performed Data Crawling and Coding for 2 Projects.
  • Maintain 99.5% Quality Level in Internship Period.
  • Developed and Maintained 30+ Financial Models for Financial Institution.

Education

Master of Science - Economics

Gokhale Institute of Politics and Economics
01.2017

Bachelor of Commerce - Costing

Symbiosis International University
01.2015

Skills

Database Management: SQL, Azure Cosmos DB, NoSQL

Certification

Financial Engineering and Risk Management Part 1, Columbia University via Coursera, 2020

Accomplishments

  • Data Scraping Framework
  • Developed Stock Market Data Scraping Model. Developed Web Crawler with Team.

MY TIME

  • A: Design and Maintain Data System
  • B: Data Modelling, Data Pipeline Flow etc.
  • C: Fixes - Bugs/Coding issues in Real Time
  • D: Business/Internal Requirements
  • E: Design and Developed of Statistical and Predictive Models.
  • F: Team Management

INDUSTRY EXPERTISE

  • LEADERSHIP
  • PROBLEM SOLVING
  • SELF CONFIDENCE
  • PLANNING
  • TIME MANAGEMENT
  • NETWORKING

NLP & LLM Models

LLaMa, BLOOM, LaMDA, BERT, PaLM, GPT, DALL-E

AIML and Big Data

AWS SageMaker, Google Colab, Azure Cognitive Services, Jupyter Notebook, Hadoop, PySpark, HIVE, AWS EMR

Data Science Toolkit

TensorFlow, KERAs, PyTorch, PANDAs, Microsoft CNTK, NumPy

Additional Information

ABC

Software

GEN Ai

Timeline

Senior Software Development Engineer - AI & ML

Optimum Solutions
06.2024 - 06.2025

Data Scientist – Product Development

Grihum Housing Finance
03.2023 - 07.2023

Analytics Manager – Data Science

PNG Jewellers
08.2022 - 04.2023

Data Analyst – Product Development

Reality Premedia Services
11.2021 - 06.2022

Data Analyst

Innoplexus
03.2019 - 05.2021

Associate Data Analyst

Innoplexus
07.2018 - 03.2019

Technical Apprentice

Innoplexus
02.2018 - 07.2018

Bachelor of Commerce - Costing

Symbiosis International University

Master of Science - Economics

Gokhale Institute of Politics and Economics
PARAG HEMANT SONARGenerative AI Engineer