Data Scientist with 2+ years of experience at Siemens Healthineers in the Molecular Imaging Department, specialized expertise in Machine Learning, Artificial Intelligence, and Generative AI models, with a patent filing in process. Demonstrated ability to drive end-to-end solution design, from conceptualization through successful Proof of Concept (PoC) and project delivery, with a focus on driving business impact and operational efficiency. Proven track record in applying AI across multiple business functions. Skilled at simplifying complex problems into manageable components for streamlined solution. Committed to staying abreast of emerging technologies, exploring new frontiers in research, and contributing to innovation.
Generative AI &NLP
- Automated the categorization of software-related defects using a fine-tuned large language model (LLM), presented in quarterly dashboards to higher management, significantly improving reporting accuracy and efficiency while reducing manual effort.
- Deployed an Open Source Code LLM on premise, performed various experiments and POCs on the practical applications of Code LLMs on Proprietary Data, contributing to the decision making of Gen AI CoC creations at DC.
- Developed and integrated a Semantic Code Search functionality utilizing the Code LLM, Embedding Model, and a VectorDB, enabling users to interact with their codebase in natural language. Assisting the new joiners and reducing their dependency on senior engineers.
- Instruction finetuned the code LLM on proprietary codebase using PEFT LoRA. Observed improved adherence to in-house coding guidelines and styles.
- Tech Stack: Python, Pytorch, HUggingFace, Transformers, PEFT LoRA, MilvusDB, FastAPI.
Data Management
- Developed a domain-specific Natural Language Search Engine using Knowledge Graph (KG) technology, overseeing architectural design, data pipelines, and KG schema.
- Engineered the Knowledge Graph as a Data Warehouse connecting to multiple relational and non-relational databases.
- Developed Entity Extraction, Sub graph Extraction, and Answer Visualizations modules, facilitating natural language queries and data retrieval with visualizations.
- Implemented a Recommendation Engine leveraging the Knowledge Graph, delivering closely related results based on searched data.
- Tech Stack: Python, FastAPI, Blazegraph, SPARQL, AWS S3, SQL.
Software Fundamentals
undefined2024 - Winner of DC SucCeSs Hackathon
Introduction to Generative AI
Cricket
Badminton
Long Drives
Pytorch
Spyder
Jupyter notebook
Exploring Technologies behind ChatGPT, GPT 4 and LLMs
Transformer Models and BERT Models
Encoder- Decoder Architectures
Attention Mechanisms
Introduction to Large Language Models
Introduction to Generative AI