Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
Work Availability
Work Preference
Websites
Languages
Interests
Quote
Software
Timeline
Generic
Yash Tomar

Yash Tomar

Data Scientist
New Delhi

Summary

Data Scientist with 2+ years of experience at Siemens Healthineers in the Molecular Imaging Department, specialized expertise in Machine Learning, Artificial Intelligence, and Generative AI models, with a patent filing in process. Demonstrated ability to drive end-to-end solution design, from conceptualization through successful Proof of Concept (PoC) and project delivery, with a focus on driving business impact and operational efficiency. Proven track record in applying AI across multiple business functions. Skilled at simplifying complex problems into manageable components for streamlined solution. Committed to staying abreast of emerging technologies, exploring new frontiers in research, and contributing to innovation.

Overview

3
3
years of professional experience
6
6
Certifications
6
6
years of post-secondary education

Work History

Data Scientist

Siemens Healthineers
01.2022 - Current

Generative AI &NLP

- Automated the categorization of software-related defects using a fine-tuned large language model (LLM), presented in quarterly dashboards to higher management, significantly improving reporting accuracy and efficiency while reducing manual effort.

- Deployed an Open Source Code LLM on premise, performed various experiments and POCs on the practical applications of Code LLMs on Proprietary Data, contributing to the decision making of Gen AI CoC creations at DC.

- Developed and integrated a Semantic Code Search functionality utilizing the Code LLM, Embedding Model, and a VectorDB, enabling users to interact with their codebase in natural language. Assisting the new joiners and reducing their dependency on senior engineers.

- Instruction finetuned the code LLM on proprietary codebase using PEFT LoRA. Observed improved adherence to in-house coding guidelines and styles.

- Tech Stack: Python, Pytorch, HUggingFace, Transformers, PEFT LoRA, MilvusDB, FastAPI.


Data Management

- Developed a domain-specific Natural Language Search Engine using Knowledge Graph (KG) technology, overseeing architectural design, data pipelines, and KG schema.

- Engineered the Knowledge Graph as a Data Warehouse connecting to multiple relational and non-relational databases.

- Developed Entity Extraction, Sub graph Extraction, and Answer Visualizations modules, facilitating natural language queries and data retrieval with visualizations.

- Implemented a Recommendation Engine leveraging the Knowledge Graph, delivering closely related results based on searched data.

- Tech Stack: Python, FastAPI, Blazegraph, SPARQL, AWS S3, SQL.


Education

Master in Technology - Data Analytics

National Institute of Technology, Trichy
Tamil Nadu
08.2020 - 07.2022

Bachelor of Technology - Computer Science

USICT
Delhi
08.2016 - 07.2020

Skills

Software Fundamentals

undefined

Accomplishments

    2024 - Winner of DC SucCeSs Hackathon

Certification

Introduction to Generative AI

Work Availability

monday
tuesday
wednesday
thursday
friday
saturday
sunday
morning
afternoon
evening
swipe to browse

Work Preference

Work Type

Full Time

Work Location

RemoteHybrid

Important To Me

Work-life balanceCompany CultureCareer advancementPersonal development programsHealthcare benefitsWork from home optionTeam Building / Company RetreatsStock Options / Equity / Profit SharingPaid sick leave

Languages

English
Bilingual or Proficient (C2)
Hindi
Bilingual or Proficient (C2)

Interests

Cricket

Badminton

Long Drives

Quote

The way to get started is to quit talking and begin doing.
Walt Disney

Software

Pytorch

Spyder

Jupyter notebook

Timeline

Exploring Technologies behind ChatGPT, GPT 4 and LLMs

03-2024

Transformer Models and BERT Models

12-2023

Encoder- Decoder Architectures

12-2023

Attention Mechanisms

12-2023

Introduction to Large Language Models

11-2023

Introduction to Generative AI

10-2023

Data Scientist

Siemens Healthineers
01.2022 - Current

Master in Technology - Data Analytics

National Institute of Technology, Trichy
08.2020 - 07.2022

Bachelor of Technology - Computer Science

USICT
08.2016 - 07.2020
Yash TomarData Scientist