Summary
Overview
Work History
Education
Skills
Awards Recognition
Certification
Project Highlights
Core Skills
Websites
Timeline
Generic

Swasti Sahay

Noida

Summary

AI/ML Lead with 10 years of experience designing and delivering enterprise-grade Generative AI and Agentic systems. Currently leading architecture and development of multi-agent LLM platforms integrating LangGraph, LangChain, and hybrid Text-to-SQL/Text-to-NoSQL pipelines across MongoDB and PostgreSQL. Strong expertise in building scalable RAG systems, LLM guardrails, contextual memory frameworks (Redis), and Azure-based AI ecosystems for real-time enterprise intelligence and intelligent automation.

Overview

10
10
years of professional experience
1
1
Certification

Work History

Lead AI/ML

Aziro
12.2025 - Current
  • Architected and deployed KAI (Knowledge & Analytics Intelligence) Agent, a multi-agent maritime analytics assistant answering complex operational and financial queries.
  • Implemented hybrid Text-to-SQL and Text-to-NoSQL pipelines integrating MongoDB (vessel/voyage documents) and PostgreSQL (finance and analytics tables).
  • Developed SQL Registry and dynamic SQL generation with guardrails for secure query execution.
  • Integrated Redis for session memory and multi-turn contextual reasoning.
  • Designed LangGraph-based step-by-step orchestration framework for complex workflow decomposition.
  • Implemented schema-driven selective Mongo projections to reduce token usage and improve efficiency.
  • Containerized services using Podman for scalable deployment.

Senior AI Developer

Tata Consultancy Services
01.2021 - 12.2025
  • Developed AI architectures for worker safety on Microsoft Azure using VLM, LLM, and LVLM models to enhance safety protocols.
  • Designed and developed GenAI-powered Safety Tracker system (IP filed).
  • Developed scalable Azure-hosted AI ecosystems integrating Custom Vision, Azure AI Search, and RAG systems.
  • Built multimodal RAG chatbots for video and image analysis using embedding and VLMs to facilitate real-time data insights.
  • Designed RAG pipelines for chatbots and data extraction with Semantic Kernel and LangChain to improve response accuracy.
  • Improved detection accuracy from 73% to 92% through architecture optimization.
  • Developed OCR and LLM-based document intelligence solutions reducing manual verification effort by 80%.

Data Scientist

Tata Consultancy Services
03.2019 - 12.2020
  • Optimized models by 15% through advanced regularization and normalization techniques.
  • Executed data cleaning, visualization, and preprocessing to enhance ML pipeline performance.
  • Collaborated with stakeholders to ensure accurate implementation of data-driven solutions.

System Engineer

Tata Consultancy Services
Lucknow
01.2016 - 01.2019
  • Automated batch processes, significantly decreasing data processing time and improving overall efficiency.
  • Developed backend programs for insurance clients in RPGile, CLLE, and PL/SQL, enhancing system functionality.
  • Led resolution of priority issues and engaged in client discussions, driving enhancements to service delivery.

Education

M.Tech - Mechanical Engineering

Harcourt Butler Technological Institute
Kanpur, Uttar Pradesh
09.2015

Skills

  • Generative AI
  • RAG Pipelines
  • Prompt Engineering
  • Fine Tuning
  • Agentic Orchestration
  • Deep Learning
  • Machine Learning
  • Computer Vision
  • Image Processing
  • Feature Engineering
  • Data Visualization
  • Python
  • SQL
  • Azure Prompt Flow
  • Microsoft Azure
  • Azure AI Search
  • Azure AI Studio
  • Azure Open AI Studio
  • Custom Vision
  • OCR
  • Cloud
  • Redis
  • MongoDB
  • Postgres
  • DuckdB
  • Jupyter Notebook
  • OpenCV
  • VS Code
  • Podman
  • Docker
  • Effective communication
  • Stakeholder engagement

Awards Recognition

  • Certificate of Appreciation for Outstanding AI Performance in IoTDE and Connected Services.
  • Multiple Beyond Performance Awards and On-the-Spot Excellence Awards for innovative AI contributions.

Certification

  • Applied Data Science Bootcamp, Massachusetts Institute of Technology (MIT-ADSB), 2020.
  • Developing AI Applications using Python and TensorFlow, University of Oxford (Online), 2020.
  • IP Filed: Gen-AI-based Contextual Response Generation from Multimedia Streams.

Project Highlights

  • Multi-agent Maritime Analytics Assistant (KAI Agent), Built a multi-agent maritime analytics assistant that answers complex operational and financial queries by orchestrating MongoDB and Postgres in real time.
  • Agentic RAG Chatbot for Safety Compliance, Designed and implemented an Agentic AI-powered chatbot using LangChain, Azure AI Search, and MongoDB Atlas Vector Search, enabling context-aware, multimodal question answering from OSHA and ANSI safety documents.
  • Real-time Hazard Detection in Drone Imagery, Built a VLM-based AI pipeline using GPT-4o to detect safety hazards from drone images and video streams.
  • Gen AI Powered Safety Tracker, Worked on design and development of Gen AI powered Safety Tracker, an inventive safety monitoring system that cross-references visual data against company policies.
  • Document Intelligence for Safety Certificates, Developed an OCR and LLM-based solution using Azure Cognitive Services and Regex to automate text extraction, validation, and hazard risk categorization from scanned safety certificates.
  • Task Hazard Identification & Mitigation System, Created a RAG-powered hazard analysis engine where users input a task description and receive OSHA-compliant hazard identification and mitigation measures.
  • LLM Evaluation & MLOps with Azure Prompt Flow, Designed and implemented Prompt Flow pipelines for evaluating LLM outputs, improving reliability and accuracy of hazard detection responses.
  • AI Architecture for Workplace Safety Automation, Designed a scalable, Azure-hosted AI ecosystem integrating Custom Vision, Prompt Flow, and RAG-based reasoning for predictive hazard identification.

Core Skills

  • Generative AI & Large Language Models:

GPT-4o, Phi-3.5, Retrieval-Augmented Generation (RAG) Pipelines, Azure Prompt Flow, Vision Language Models (VLMs), Embedding Models, Prompt Engineering, Fine-Tuning (LoRA)

  • Agentic AI & Orchestration:

LangChain, LangGraph, Semantic Kernel, Redis

  • Deep Learning & Computer Vision:

Convolutional Neural Networks (CNNs), Object Detection (YOLOv5, YOLOv8), Segment Anything Model (SAM), Image Processing, OpenCV, TensorFlow

  • Machine Learning & Data Science:

Regression Models, K-Means Clustering, Principal Component Analysis (PCA), Feature Engineering, Model Optimization

  • Programming & Data Handling:

Python (Seaborn, Plotly, Matplotlib), SQL, Data Cleaning, Data Preprocessing, Data Annotation

  • Cloud Platforms & AI Services:

Microsoft Azure (App Service, AI Studio, Custom Vision, OCR, Azure AI Search, Azure OpenAI Studio, Azure Prompt Flow), Google Cloud Platform (GCP)

  • Databases & Vector Search:

MongoDB, PostgreSQL, DuckDB, MotherDuck, Vector Search (MongoDB Atlas)

  • DevOps & Development Tools:

Podman, Docker, Jupyter Notebook, VS Code

  • Architecture & Integration:

API Integration, Document Intelligence, AI Architecture Design, Proposal Writing

Timeline

Lead AI/ML

Aziro
12.2025 - Current

Senior AI Developer

Tata Consultancy Services
01.2021 - 12.2025

Data Scientist

Tata Consultancy Services
03.2019 - 12.2020

System Engineer

Tata Consultancy Services
01.2016 - 01.2019

M.Tech - Mechanical Engineering

Harcourt Butler Technological Institute
Swasti Sahay