Summary
Overview
Work History
Education
Skills
Projects
Timeline
Generic

Swarna Mallik

Data Scientist
Bengaluru

Summary

Data Scientist with 4.5 years of experience in NLP and Deep Learning, and 2+ years in Generative AI. Key contributor to KnowRA+ - an award-winning, domain-customizable virtual assistant leveraging RAG, MiniLM, Qdrant, and more. Proven ability to deliver production-ready AI solutions that enhance service delivery. Skilled in Statistics, Machine Learning, Python, NLP, Hugging Face and Generative AI with a strong focus on continuous learning and innovation.

Overview

5
5
years of professional experience
25
25
years of post-secondary education

Work History

Deputy Manager

WNS Global Services
02.2021 - Current
  • Developed deep learning solutions using computer vision for image processing and denoising, enabling actionable insights, and enhanced decision-making.
  • Implemented OCR and NLP techniques to extract key metadata and patient information from unstructured healthcare documents, improving data usability and productivity.
  • Spearheaded the development of domain-specific GenAI chatbots for clients in travel and insurance, leveraging Retrieval-Augmented Generation (RAG) to reduce Average Handling Time (AHT), and improve First Call Resolution (FCR).
  • Played a core role in building KnowRA that won multiple awards at the 2024 Stevie International Business Awards.
  • Drove continuous innovation through R&D and integration of ML/AI techniques to enhance solution efficacy across domains.

Data Analyst (Intern)

SigmaWay LLC
06.2020 - 07.2020
  • Performed predictive analytics using ML techniques on the sales data of a company.

Data Analyst (Intern)

IIT Guwahati
05.2020 - 06.2020
  • Used classification models to generate insights from data.

Education

Certification - Applied Data Science using AI & ML

IIT Delhi
Delhi
04.2001 - 08.2024

Master of Science - Applied Economics

Presidency University
Kolkata, India
05.2018 - 06.2020

Skills

  • Statistics
  • Machine Learning
  • Deep Learning
  • Natural Language Processing (NLP)
  • Large Language Models (LLM)
  • Retrieval-Augmented Generation (RAG)
  • LangChain
  • LangGraph
  • Qdrant
  • Prompt engineering
  • Knowledge graph
  • Python
  • SQL
  • Tableau
  • Microsoft Azure

Projects

KnowRA+: Domain-Specific Virtual Assistant

Ongoing (2023–2024)
Led the development of KnowRA+, an award-winning domain-adaptive virtual assistant built using Retrieval-Augmented Generation (RAG). Designed to handle domain-specific queries across travel, insurance, and more, it integrated MiniLM for semantic search and Qdrant for efficient vector storage, enabling scalable and context-aware responses.
Tech Stack: RAG, MiniLM, Qdrant, LangChain, Generative AI


Image processing and metadata extraction 

Aug 2021
Implemented OCR pipelines for processing highly unstructured healthcare documents, enabling accurate extraction of
critical information from scanned records and handwritten text. Also applied image preprocessing and deep learning techniques to
improve data quality and downstream NLP performance in clinical and administrative workflows.
Tech stack: OCR, Computer vision, OpenCV, NLP techniques


Data Anonymisation 

Aug 2024
Developed an advanced anonymization system utilizing Named Entity Recognition (NER) to process unstructured legal
text documents. The system effectively transforms PII and other sensitive data to minimize the risk of re-identification,
ensuring strict compliance with data privacy regulations and significantly enhancing information security.
Tech stack: ML, NLP, faker etc.


Lumbar Spine Degenerative Classification 

Aug 2024
Built a comprehensive classification system for lumbar spine degeneration using Magnetic Resonance Imaging (MRI)
scans, aimed at detecting and enhancing diagnostic accuracy and clinical decision-making.
Tech stack: Various neural network models like EfficientNet, InceptionV3, ResNet etc.


Timeline

Deputy Manager

WNS Global Services
02.2021 - Current

Data Analyst (Intern)

SigmaWay LLC
06.2020 - 07.2020

Data Analyst (Intern)

IIT Guwahati
05.2020 - 06.2020

Master of Science - Applied Economics

Presidency University
05.2018 - 06.2020

Certification - Applied Data Science using AI & ML

IIT Delhi
04.2001 - 08.2024
Swarna MallikData Scientist