Summary
Accomplishments
Work History
Education
Websites
Overview
Skills
Certification
Languages
Timeline
AssistantManager
Surya Pradeep Kumar Varma

Surya Pradeep Kumar Varma

Hyderabad

Summary

  • Seasoned Data Scientist and Generative AI Lead with a proven track record of delivering AI-driven solutions across startup, scale-up, and enterprise-level companies.
  • Recognized with multiple excellence awards across organizations for professional work leveraging core expertise in ML, NLP, large language models (LLMs), and deep learning to solve complex business challenges and drive measurable impact.
  • Alumnus of IIT BHU (B.Tech, Computer Science) and Deakin University (Master's, Data Science).
  • Currently leading Gen-AI initiatives at Space Inventive for Novartis, delivering projects from ideation through development to deployment.

Accomplishments

  • Successfully delivered high-impact Gen AI use-case presentations to 15+ C-suite healthcare executives at the US GDS Healthcare Conference, demonstrating technical expertise and strategic vision that directly resulted in positive follow-up dialogues, strategic partnership discussions, and proof-of-concept agreements with tier-1 US healthcare organizations including Hackensack Meridian Health and Cano Health.
  • Awarded Star of the Quarter Award & Excellence Award at Curl Analytics.
  • Awarded Spot Award at Space Inventive: Recognized for accelerating Gen AI development and championing cross-functional collaboration on technical architecture, showcasing innovation and leadership.

Work History

Senior Data Scientist

Space Multimedia
11.2023 - Current

Novartis | Gen AI Solutions, Innovation & Scalable Delivery
* Orchestrated the development of production-grade Gen AI solutions, aligning technical teams and
business stakeholders to deliver high-impact initiatives, driving wide adoption.
* Custom Content Generation Framework: Engineered a RAG pipeline (Llama Index) that reduced retrieval latency by 35% through holistic context integration, earning stakeholder endorsement for a PoC now in active development.
* Accelerated innovation by building 10+ scalable prototypes (e.g., Virtual Presenter Bot, Deep Research Agent) with modular architectures, reducing validation timelines by 40% and securing executive buy-in for R&D expansion.

Client Pitches & Demos
* Led creation of tailored Gen AI use-case decks for executives at the GDS Healthcare Conference, collaborating with data science, marketing, and design teams.
* Delivered technical pitches to 15+ US healthcare leaders, sparking partnership dialogues with Hackensack Meridian, Cano Health, and others.

Interviews & Hiring Processes
* Revamped Data Science hiring process, reducing mis-hire risk and time-to-hire via comprehensive candidate scoring and cross-functional interview panels. Interviewed numerous candidates to onboard to the Data Science and Generative AI Team.

Awards & Recognition
* Spot Award (Space Inventive): Recognized for accelerating Gen AI PoC development and championing cross-functional collaboration on technical architecture, showcasing innovation and leadership.

Data Scientist II

Curl Analytics
Bangalore
04.2021 - 11.2023
  • Company Overview: Bangalore (Website)
  • Led key projects at Curl Hg, taking on multiple responsibilities, driving the development of, and later took full ownership of multiple critical ML components for Sara Document AI :(Document Processing / Information Extraction Engine): NER Models (SPV, ANA, Banking NER), Address Extraction Model, Topology (Document Layout Parser), and a custom Post-processing engine with 95% test coverage in all the modules.
  • Fine tuned BLOOM (BigScience Large Open-Science Open-Access Multilingual language Model) LLM as a Question Answering Model.
  • Built and scaled custom NER engine forming the NER backbone for the product dealing with 19 different entities for information extraction from unstructured document data (multiple formats).
  • Model is an ensemble of custom-built transformer based NER model, with a custom built dynamic searching algorithm and pattern-based heuristics followed by NER Entity conflict resolution to predict the custom NER entities.
  • Profiled and Optimized the end-to-end pipeline in Sara-Backend to attain a mean latency improvement of 35% per page processed.
  • Bangalore (Website)

Junior Data Scientist

Curl Analytics
Bangalore
08.2020 - 04.2021
  • Company Overview: Bangalore (Website)
  • Researched multiple NER models including flair, bert and spaCy.
  • Developed Bi-LSTM based ANA (Anchor / keyword recognition NER model) with improved recall of 93%.
  • Topology Classification ran 3x faster using the integrated ANA NER model.
  • Bangalore (Website)

Remote Intern

TCS iON
05.2020 - 06.2020
  • Company Overview: Remote (Website)
  • Built a Sentiment Classifier Model to classify sentiments from IMDB's textual movie reviews using traditional and deep learning techniques using Tensor flow, Keras, PyTorch and Bert libraries.
  • Remote (Website)

Internship

NI Innovation Center
Noida
04.2020 - 05.2020
  • Company Overview: ITS Engg. College, Noida
  • Developed three projects on a CNN-based digit classifier model, home automation, simulation of an Arduino on Simulide, NI LabVIEW and ThingspeakIOT platforms.
  • ITS Engg. College, Noida

Internship Trainee

Ministry of Electronics and Information Technology (MeitY)
Delhi
05.2019 - 07.2019
  • Company Overview: Govt. Of India
  • Worked on MietY Startup Hub(MSH) Project in the role of Trainee, worked on exploratory data analysis of data pertaining to diverse startups.
  • Govt. Of India

Education

Master's in Data Science - undefined

Deakin University
09.2023

Post Graduate Degree in Artificial Intelligence and Machine Learning - undefined

The University of Texas at Austin
01.2021

Bachelor's in Computer Science and Engineering - undefined

Indian Institute of Technology (IIT BHU)
01.2020

Overview

6
6
years of professional experience
5
5

Certifications

Skills

  • Python3
  • R
  • SQL
  • C
  • Bash
  • PyTorch
  • OpenCV
  • Tensorflow
  • Keras
  • Scikit-learn
  • Numpy
  • Pandas
  • Dask
  • Scipy
  • Plotly
  • LLM
  • SpaCy
  • NLTK
  • HuggingFace
  • Lavis
  • Promptify
  • LangChain
  • Azure
  • Azure ML
  • Tableau
  • Matlab
  • Redis
  • MongoDB
  • Git
  • Jupyter
  • LabVIEW
  • Linux/UNIX
  • Open AI
  • Generative AI

Certification

  • HarvardX Data Science Professional | Online Certified Program by Harvard University
  • Deep Learning Specialization | deeplearning.ai
  • Machine Learning | Prof. Andrew Ng, Stanford University
  • Master SQL for Data Scientists

Languages

English
Bilingual or Proficient (C2)
Hindi
Bilingual or Proficient (C2)
Telugu
Bilingual or Proficient (C2)
French
Beginner (A1)
Bengali
Elementary (A2)

Timeline

Senior Data Scientist

Space Multimedia
11.2023 - Current

Data Scientist II

Curl Analytics
04.2021 - 11.2023

Junior Data Scientist

Curl Analytics
08.2020 - 04.2021

Remote Intern

TCS iON
05.2020 - 06.2020

Internship

NI Innovation Center
04.2020 - 05.2020

Internship Trainee

Ministry of Electronics and Information Technology (MeitY)
05.2019 - 07.2019

Master's in Data Science - undefined

Deakin University

Post Graduate Degree in Artificial Intelligence and Machine Learning - undefined

The University of Texas at Austin

Bachelor's in Computer Science and Engineering - undefined

Indian Institute of Technology (IIT BHU)
Surya Pradeep Kumar Varma