Summary
Overview
Work History
Education
Skills
Certification
Publications and Patents
Additional Information
Sanjana Sahayaraj

Sanjana Sahayaraj

Senior Data Scientist Lead
Bangalore,KA
The price of inaction is far greater than the cost of a mistake
Meg Whitman

Summary

Enthusiastic Data Scientist, Researcher and Engineer eager to contribute to team success through collaboration, planning, hard work and continuous learning. Promoted to a Senior and Lead level within 2 years of graduating from Masters. Motivated to learn and grow together in solving everyday human and business problems, through applying science and thought to data.

Overview

4
4
Certifications
6
6
years of post-secondary education
4
4
years of professional experience

Work History

Senior Data Scientist Lead

IBM
Bangalore, KA
08.2020 - Current
  • Worked remotely with a team distributed across India and US to deliver 4 infrastructure management and AIOps projects successfully within a period of 6 months
  • Lead 2 Data Scientist teammates across projects to realize joint cost saving deliveries that provided additional value to customers
  • Mentored 1 Data Engineer teammate interested in learning Data Science in EDA and developing models
  • Lead patent brainstorming and drafting sessions with 3 first time patentees, in topics of ModelOps, AutoAI and Infrastructure Investment Recommendation Systems

Research Software Engineer

IBM Research
San Jose, CA
01.2019 - 07.2020
  • Co-developed a state of the art domain adaptive semantic role labeling component for a generic NLP library, with 4 teammates
  • Co-developed a multilingual negation detection system to differentiate between disease presence and absence in English and French EMRs, with one teammate
  • Co-developed and co-published a novel word embedding technique to embed ordering knowledge from ontologies, with one teammate
  • Developed an NLP system to build a polymer knowledge base from publications and patents on polymer discovery and synthesis

Research Assistant

UCSB
Santa Barbara, CA
09.2016 - 12.2018
  • Developed hybrid embedding technique including word level and character level representations to help with a medical building a question answering system
  • Developed a regression model to predict length of stay in hospital of coagulopathy patients
  • Developed a clustering algorithm to identify groups of miRNA that play different roles in protein synthesis
  • Teaching Python, C++ and Unix basics to undergraduate students

Graduate Research Intern

IBM Research
San Jose, CA
06.2018 - 09.2018
  • Research into CoNLL shared tasks and the current techniques and how current state of the art are not domain adaptive
  • Lead the development and release of two domain specific PropBank datasets by working with subject matter experts in finance and compliance domain

Education

Master of Science - Computer Science

University Of California, Santa Barbara, Santa Barbara, California, United States
09.2016 - 12.2018
  • Course work in Advanced Distributed Systems, Application Backend Architecture, Deep Learning in Practice, Matrix Operations, Theoretical Foundations of Machine Learning and Automated Verification
  • Worked as a Teaching Assistant and Research Assistant, developing presentation, planning and research skills

Bachelor of Science - Computer Science

Anna University, Chennai, Tamil Nadu, India
08.2012 - 04.2016

Course work covering: Assembly Programming, Basic Programming with C, Differentiation and Integration, Object Oriented Programming with C++ and Java, Networking Basics, Digital Signal Processing, Digital Communication Systems, Microprocessors and Microcontrollers, Discrete Mathematics, Data Structures, Design and Analysis of Algorithms, Theory of Computation, Grid Computing, Information Security, Operating Systems, Database Management System, Principles of Compiler Design, Numerical Methods, Advanced Computer Architecture and Artificial Intelligence.

Skills

Data manipulation

undefined

Certification

Deep Learning: Advanced NLP and RNNs

Publications and Patents

  • Order Embeddings from Merged Ontologies using Sketching: https://arxiv.org/abs/2101.02158
  • P201906891US01: Bias Identification and Correction in Text Documents
  • P20190280US01: Domain Adaptive Semantic Role Labeling
  • P202005049US01: Embodying order relations by jointly utilizing domain-specific ontologies to embed terms
  • P202007865PK01: Real Time Identification of Changed-Induced Incidents in AIOPs Applications

Additional Information

  • Hobbies include photography, reading and trying new recipes.
  • Sports include Badminton, Muay Thai and Indoor archery.
  • Can read, write and speak Tamil and English very well. Have a TOEFL score of 116/200.
  • What my co-workers have to say about me can be found in Recommendations section on my LinkedIn at https://www.linkedin.com/in/sanjana-sahayaraj/.
Sanjana SahayarajSenior Data Scientist Lead