Experienced Senior Data Scientist with 7 years of proven expertise in developing advanced AI solutions, now specializing in generative AI. Passionate about leveraging cutting-edge technologies to drive innovation and create impactful applications. Seeking to contribute deep technical knowledge and creative problem-solving skills to a forward-thinking organization aiming to push the boundaries of AI and machine learning. Looking to collaborate with diverse teams to design and deploy transformative AI models that deliver tangible value to users and stakeholders.
Knowledge Brain – Sensitive Information Extraction Using RAG from Contract Documents.
This project focuses on extracting sensitive information from contract documents using Retrieval-Augmented Generation (RAG). It is structured into two main components: ingestion and retrieval.
Contract documents are ingested from a Google Cloud Storage (GCS) bucket. The ingested data is stored in two databases:
AlloyDB: Stores structured document information for fast querying.
Spanner Graph Database: Used to extract entities and relationships to build an ontology that represents the document's semantic structure.
In this phase, RAG-based pipelines are used to extract answers to specific queries. The system leverages:
(i) LLM prompting to generate accurate responses.
(ii) Data from both AlloyDB and the Spanner graph to provide context-aware, semantically rich answers based on the ingested contract data.
Extract appropriate answers to a list of questions from an insurance proposal document using the RAG (Retrieval-Augmented Generation) method
Anonymization engine for data masking
Use Generative AI algorithm to get the proper feedback labels from given feedback text
Agent assist for marketing domain
MIA(Market Intelligence Assist)
Concierge for life science domain
Life science concierge aims to automate the categorization and extraction of specified data from health authority correspondence using a machine learning , deep learning and CRF approach. This will also enable the automated updating of a specific framework based on the correspondence.
Autosuggestion tool development for HR domain leave policy data
QA Virtual Assistant to troubleshoot HR domain leave policy data & label prediction of HR data
Virtual Assistant for telecommunication domain to handle customer quarries
Python , R , SQL
Statistical Analysis
Machine Learning , Deep Learning , NLP
Generative AI, Prompt Engineering, Spanner Graph
Google Dialogflow , RASA
Docker , Kubernetes, GCP
Problem Solving , Project Management
Introduction to Machine Learning in Production, Coursera
Introduction to Machine Learning in Production, Coursera