Summary
Overview
Work History
Education
Skills
Projects
Timeline
Generic

Ashish Som

Bengaluru

Summary

Associate Consultant at Bridge Medical Consulting, adept at engineering scalable AI pipelines and optimizing data workflows. Proven expertise in Python and problem-solving, delivering actionable insights through advanced data analysis using AI and collaboration with cross-functional teams. Successfully enhanced document evaluation processes, ensuring high-quality outputs aligned with clinical research standards.

Currently exploring advanced generative AI concepts, including LLM fine-tuning, RAG models, and agentic AI systems.

Overview

7
7
years of professional experience

Work History

Associate Consultant (AI)

Bridge Medical Consulting Services Pvt. Ltd.
07.2024 - Current
  • Developing multiple production-grade AI pipelines for healthcare-related text processing tasks.
  • Creating reusable, config-driven scripts to clean, de-duplicate, and format title-abstract data from master Excel files.
  • Engineering pipelines with custom PDF readers, dynamic prompt builders, and result-savers for document evaluation using LLMs.
  • Building modular systems to summarize long documents using OpenAI APIs, with prompt tuning and batch processing capabilities.
  • Optimizing full-text screening and extraction workflows to process PDFs, extract structured insights, and export results in Excel format.
  • Designing main.bat launchers for initiating YAML- based configurable pipelines, ensuring high portability and scalability.
  • Managing testing cycles, prompt refinement, and result validation to maintain data quality, and ensure business alignment.
  • Collaborating with medical research teams involved in clinical SLR (Systematic Literature Review) to validate and align AI outputs with domain-specific expectations.

Data Scientist

Learnvista Pvt. Ltd.
Bengaluru
08.2021 - Current
  • Leveraging Python for data preprocessing, feature engineering, and building predictive models.
  • Applying statistical techniques to extract insights from large datasets.
  • Developing and fine-tuning regression, classification, and time series models.
  • Ensuring robust model evaluation and optimization.
  • Collaborating with cross-functional teams to translate data-driven insights into actionable recommendations.
  • Communicating findings effectively to stakeholders.

Data Analyst

Zhengzhou Intel Translation Company
Zhengzhou
06.2018 - 01.2020
  • Created various Excel documents to assist with pulling metrics data and presenting information to stakeholders for concise explanations of best placement for needed resources.
  • Collaborated with business-unit leaders to identify and prioritize problems.
  • Used statistical methods to analyze data and generate useful business reports.
  • Utilized data visualization tools to effectively communicate business insights.

Education

Certificate in Data Science - AI and ML

IBM Online
Bengaluru
05.2021

MBA - Marketing and HR

Uttaranchal Institute of Management
Dehradun
01.2011

BCA -

CCS University Meerut
Meerut
01.2009

Skills

  • Programming and Scripting: Python, OOP, YAML, Batch Scripting
  • Libraries and Tools: NumPy, Pandas, OpenAI API, Matplotlib, Seaborn
  • Data Handling: Excel, CSV, PDF Parsing
  • AI and LLM: Prompt Engineering, API Integration, Result Parsing
  • Automation: Config-driven Execution, Modular Pipeline Design
  • Version Control: Git, GitHub
  • Soft Skills: Problem Solving, Communication, Cross-functional Collaboration

Projects

1. TIAB Screening Automation System (LLM-Powered)

Tools & Tech: Python, OpenAI API, Pandas, PyPDF2, YAML, Excel, Batch Scripts.

Description: Designed and deployed an end-to-end automation pipeline to conduct TIAB (Title and Abstract) screening using LLMs. The system screens large volumes of biomedical abstracts and classifies them based on inclusion/exclusion criteria.

  • Built a reusable config-driven engine: Enabled dynamic execution by just modifying YAML config files (file paths, flags, model settings).
  • Developed a custom PDF Reader module to map each document to its corresponding metadata entry.
  • Engineered a Prompt Builder that reads screening logic from Excel and dynamically generates structured prompts for the LLM.
  • Integrated OpenAI APIs and implemented a response parser to extract and format model decisions into a structured Excel file.
  • Added logging, testing hooks, and validation mechanisms for model confidence and quality control.

Outcome: Cut down manual screening effort by 70%, reduced processing errors, and scaled seamlessly for large datasets.

2. Full-Text Screening and Extraction Pipeline (Advanced AI Automation)

Tools & Tech: Python, PyPDF2, OpenAI API, Pandas, YAML, Excel

Description: Created a scalable AI pipeline to screen full-text research papers for complex criteria (e.g., clinical relevance, population criteria).

  • Parsed full-text PDFs and mapped them to custom IDs to process and parse with consistency
  • Engineered a dynamic prompt builder to generate structured prompts aligned with defined schemas for LLM-based screening and data extraction.
  • Incorporated prompt engineering and iterative prompt fine-tuning workflows to optimize output quality across diverse research contexts.
  • Enabled project-level customizations for prompt formats, schema variations, and rule-based inclusions.

Outcome: Significantly enhanced the scalability and accuracy of AI-assisted literature reviews, reducing manual workload and improving the consistency of evidence synthesis.

3. LLM-based Text Summarization Pipeline

Tools & Tech: Python, OpenAI API, PyMuPDF, Pandas, YAML

Description: Built a modular pipeline to automatically summarize full-text PDFs in the medical and scientific domain using LLMs.

  • Ingested documents in bulk and converted them into clean text blocks.
  • Engineered flexible, section-wise summarization prompts (e.g., Introduction, Methods, Conclusion) to increase relevance.
  • Included a test run module for prompt tuning and iterative improvement of summary length, coherence, and factuality.

Final summaries were exported to Excel for business user validation.

Timeline

Associate Consultant (AI)

Bridge Medical Consulting Services Pvt. Ltd.
07.2024 - Current

Data Scientist

Learnvista Pvt. Ltd.
08.2021 - Current

Data Analyst

Zhengzhou Intel Translation Company
06.2018 - 01.2020

Certificate in Data Science - AI and ML

IBM Online

MBA - Marketing and HR

Uttaranchal Institute of Management

BCA -

CCS University Meerut
Ashish Som