Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Generic

Kaleeswaran M

Bangalore

Summary

Sr. Software Engineer in AI/ML with 7 years of experience primarily in Medical Technology and Media and Entertainment domain, solving problems in Computer Vision, Video processing, Audio processing, Multi-modal models, Generative models, model optimization, etc. Working experience in Google Cloud Platform and Microsoft Azure. Apart from technical expertise have performed in techno-functional role leading a small sized team.

Overview

7
7
years of professional experience

Work History

Senior Sofware Engineer - AI/ML

Stryker
Bangalore
11.2021 - Current

Working with Endoscopy R&D division of Stryker to build efficient computer vision algorithms to be deployed in Stryker's endoscopic devices.

Surgical Phase Estimation.

  • Developed a Vision Transformer based model to estimate the surgical phase at a given point of time and achieved near state of art accuracy of 88%.
  • Developed a GAN based data standardization model to improve generalizability by 8%. Have filed the patent - 'Style transfer of Surgery videos for Custom Visualization'.
  • Implemented a modified Grad-CAM for explainability of the model.

Surgical Foundation model

  • Trained a large scale self-supervised model based on VideoMAE on surgical videos.
  • Validating the foundational model generalizability on Image quality assessment task.

Surgical anatomy segmentation

  • Developed a surgical anatomy segmentation model with super-resolution supervision.
  • Optimized the model to run at 13 FPS at 1280x720p on the Jetson Xavier.

Surgical Automatic Speech Recognition

  • Finetuned Whisper ASR model on surgical vocabulary on a TTS generated dataset.
  • Improved the finetuning performance with LoRA based finetuning.

Senior Machine Learning Engineer

Quantiphi Analytics
Mumbai
05.2018 - 11.2021

Worked mostly with AthenasOwl team in solving AI/ML problems in Media and Entertainment domain. Work involved solving problems mainly in Video processing and audio processing for Media companies.

Audio Keyword Detection

  • Built a CNN-LSTM based architecture with CTC loss to detect particular keywords in Football commentary track for Bundesliga.

Audio-Visual Speech separation

  • Built a model based on ‘Looking to Listen: Audio-Visual speech separation’, to separate audio waveforms of speech from rest in a mixed audio waveform and assign the separated speech waveforms to the individual speakers in the video.

Scene Boundary Detection

  • Conceptualized and implemented a model end-to-end from data to network to detect meaningful boundaries of scenes in a media content.

Auto-highlight clip classification

  • Build a model leveraging Visual and Audio modality to classify the event happening in a football clip.
  • Was able to validate model's improved performance by fusing visual and audio representations rather than having the model trained only through the visual representations.

Content Recommendation

  • Built an explainable Recommendation system on the MovieLens 20M dataset based on ‘Knowledge Graph Convolution Network for Recommendation system’, which uses knowledge graphs as auxiliary information.

Education

Bachelor of Technology - Metallurgical And Materials Engineering

Indian Institute of Technology Roorkee
Roorkee
05-2018

Skills

  • Video processing
  • Audio processing
  • Computer vision
  • Speech recognition
  • Model optimization
  • Cloud - Azure, GCP
  • Distributed computing - PySpark
  • Algorithm development

Accomplishments

  • Filed the patent - 'Style Transfer Of Surgery Videos For Custom Visualization'.

Timeline

Senior Sofware Engineer - AI/ML

Stryker
11.2021 - Current

Senior Machine Learning Engineer

Quantiphi Analytics
05.2018 - 11.2021

Bachelor of Technology - Metallurgical And Materials Engineering

Indian Institute of Technology Roorkee
Kaleeswaran M