Summary
Overview
Work History
Education
Skills
Certification
Accomplishments
Publications
Projects
Timeline
Generic
Harish Natarajan

Harish Natarajan

Summary

Sophisticated Machine Learning Engineer with a specialization in Generative AI, machine learning, and computer vision. Proficient in developing and deploying ML and CV web apps on Azure, Heroku, GCP, and AWS using Docker for seamless scalability. Demonstrated expertise in architecting efficient solutions that leverage cutting-edge technologies to meet diverse business requirements.

Overview

2
2
years of professional experience
4
4
years of post-secondary education
3
3
Certifications

Work History

Computer Vision Engineer

Aftershoot
04.2022 - Current
  • Contributed to the development of a sophisticated Retrieval Augmental Generation Pipeline and optimized Large Language Model (LLM) models using cutting-edge techniques like Parameter Efficient Finetuning (PeFT), Low Rank Adaptation (LoRA), and QLoRA. These improvements were tailored to efficiently handle bespoke inquiries within the Aftershoot Ecosystem, resulting in a notable 40% reduction in workload for the customer success team.
  • Engineered an internal image generation tool leveraging stable diffusion for model training purposes, subsequently deploying it on AWS Sagemaker for seamless integration. Additionally, conducted research and devised logic for monitoring Model Drift, facilitating timely model retraining in response to data drift occurrences.
  • Developed a deep learning Vision Transformer model for closed eyes and emotion detection, achieving 90% accuracy and capable of detecting individuals wearing sunglasses.
  • Researched and implemented machine learning solutions to address business challenges affecting 10,000 photographers.
  • Automated data preprocessing pipelines and model training workflows with Python scripting, reducing manual efforts by 50%.
  • Deployed diverse computer vision models including face detection, pose estimation, and eye gaze detection within the Aftershoot ecosystem.
  • Engaged in the development of a diverse range of deep learning models, including speech fluency detection, speech-to-text, and various others. Quantified these models into CoreML, OpenVINO, and ONNX formats, and subsequently deployed them within a Kubernetes cluster.
  • Streamlined data preprocessing and model training pipelines by automating routine tasks with Terraform Python scripting, reducing time spent by 50% on manual processes significantly.
  • Optimized cloud storage costs by 3x and established deterministic training pipelines for reproducibility.

Machine learning and DevOps Internship

Tekolutions.ai
01.2022 - 04.2022
  • Migrated Application and Database infrastructure from Azure to DigitalOcean for enhanced performance and scalability.
  • Developed TableCellNet, an state-of-the-art (SOTA) model designed to detect and segment tables and their cells, enabling better text extraction for OCR models. Deployed the service on Azure using Flask API for seamless integration.

Education

Bachelors of Engineering - Information Technology

Mahatma Gandhi Mission's College of Engineering And Technology
Kamothe, Navi Mumbai
08.2018 - 05.2022

Skills

  • Web Frameworks (Flask, Streamlit, Django)
  • undefined

    Certification

    AI For Medicine Specialization

    Accomplishments

    • Runner up Smart India Hackathon 2020 (Grand Finale)
    • Published AudioFeaturizer Python Package
    • Participated in Govt-Techthon 2020 (IEEE Computer Society)
    • Created Pyspark with opencv Docker image
    • Winner Manthan National Security Hackathon 2021 (Grand Finale)

    Publications

    • Career Advisor Using AI Paper Published on Springer
    • Published TNet-Segmentation Python Package for medical image segmentation

    Projects

    Face Recognition at Varied Angles in Live CCTV (MANTHAN NATIONAL SECURITY HACKATHON 2021)

    • Currently, face recognition works well when a face is directly visible on the camera
    • We have developed a system that would analyze live CCTV footage to detect and identify suspects
    • The algorithm is capable of recognizing suspects even if we have a single reference image
    • The reference image could be a sketch or it could be an old image of the suspect
    • To maintain output FPS, parallel processing of video is done
    • Once the suspect is recognized, live notification regarding the frame number, camera details, location of the camera, and the name of the suspect is sent to the admin via mail and also displayed in the dashboard
    • Admin is provided with a dashboard in which he or she can add, update or remove the suspect images
    • Also, Admin can upload recorded videos to perform analysis
    • The algorithm is capable of recognizing a suspect even if he/she wears a mask


    Soil Fertility Prediction and Recommendation With EDA

    • We all face this problem wherein government asks us to upload the same document in multiple places
    • Doc Uploader is a solution to that problem
    • Here user only needs to upload the document once. Then OCR is used to get the text out of the document
    • To verify that the details are correct, we would display and ask for correction
    • If the detail is correct, it gets stored in the database from where all government sites can process the information
    • Currently, the app supports PAN card and Aadhar Card
    • The OCR works for multiple Indian Language
    • We've also added regex since the details in documents differ


    Doc Uploader an OCR based scanner (IEEE GOV-TECHTHON 2020)

    • We all face this problem wherein government asks us to upload the same document in multiple places
    • Doc Uploader is a solution to that problem
    • Here user only needs to upload the document once. Then OCR is used to get the text out of the document
    • To verify that the details are correct, we would display and ask for correction
    • If the detail is correct, it gets stored in the database from where all government sites can process the information
    • Currently, the app supports PAN card and Aadhar Card
    • The OCR works for multiple Indian Language
    • We've also added regex since the details in documents differ


    Job Prediction for Disabled People (SMART INDIA HACKATHON 2020)

    • Disabled people find it too difficult for getting a job so we plan to develop a web app that would help them
    • The web app is developed for disabled people so that predicting a suitable job for them becomes easier
    • The app would take disabilities and qualifications as input and predicts suitable job title using the inputs
    • Also, the KMeans algorithm is used to recommend similar jobs to them


    Timeline

    Computer Vision Engineer

    Aftershoot
    04.2022 - Current

    Machine learning and DevOps Internship

    Tekolutions.ai
    01.2022 - 04.2022

    Bachelors of Engineering - Information Technology

    Mahatma Gandhi Mission's College of Engineering And Technology
    08.2018 - 05.2022
    Harish Natarajan