Summary
Overview
Work History
Education
Skills
Accomplishments
Timeline
Generic

Selvakumar Sathapillai

ML/AI DevOps Engineer
Bengaluru

Summary

A dedicated ML/AI DevOps Engineer with over 12 years of experience in software development and systems integration, specializing in AI/ML pipelines, model deployment, and CI/CD automation. Experienced with cloud platforms (AWS, GCP), Docker, Kubernetes, and Spring.

Proficient in working with databases (MySQL, Oracle, MongoDB, DB2) and web services for integrating ML models into production systems. Extensive experience with WebLogic, WebSphere, Apache Tomcat, and AWS services including S3, SQS, DynamoDB, and Spring Boot.

Experienced in leading teams through the complete lifecycle of ML/AI projects, from requirement gathering to deployment. Passionate about continuous learning, with a focus on delivering innovative solutions and optimizing workflows in ML/AI-driven environments. Adept at working in fast-paced, dynamic teams while ensuring high-quality, timely project delivery.

Overview

13
13
years of professional experience

Work History

DevOps Engineer – Machine Learning

Blume Global
Bangalore
05.2020 - Current
  • CI/CD Pipeline Implementation: Designed and automated CI/CD pipelines using Jenkins for the deployment of machine learning models, ensuring efficient integration and deployment across multiple environments.
  • Containerization & Orchestration: Utilized Docker to containerize ML models and applications, deploying them on Kubernetes for scalable and resilient production environments.
  • Deployment: Developed and deployed Flask-based microservices for model inference, optimizing performance with Gunicorn to serve high-traffic production requests.
  • Data Pipeline Automation: Integrated Apache Airflow to automate and schedule complex data workflows, including data preprocessing, feature engineering, and model retraining. Managed dependencies and DAGs to ensure smooth data pipeline execution.
  • Messaging & Event Streaming: Integrated Apache Kafka to handle event-driven architectures and manage real-time data streams for model inference and prediction in production environments.
  • Model Deployment & Monitoring: Implemented model deployment pipelines using Flask APIs and Gunicorn, enabling seamless model updates and ensuring robust API performance under load.
  • Data Management: Leveraged MongoDB for scalable, NoSQL data storage solutions and Apache Hive for large-scale data warehousing, enabling efficient data retrieval for model training.
  • Cloud Infrastructure Management: Deployed and managed machine learning models on GCP (Google Cloud Platform), utilizing services like AI Platform, Cloud Functions, and BigQuery for model training, data processing, and storage.
  • Model Monitoring & Logging: Utilized Prometheus and Grafana to track performance metrics (latency, accuracy, resource utilization) of deployed models, ensuring proactive monitoring and timely issue resolution.
  • Model Retraining Automation: Set up automated model retraining pipelines that used live data from Kafka streams and stored in MongoDB and Hive, ensuring models continuously evolve based on new incoming data.

Technical Lead Engineer

Manhattan Associates
Bangalore
01.2018 - 04.2020
  • Worked on the Warehouse Management Open Systems and Slotting Optimization project, quickly becoming the Critical Point of Contact for Work Release, Location, and Layout components, mentoring on-shore and off-shore teams to ensure successful task completion and meeting objectives.
  • Led code reviews, design reviews, and ensured process adherence for source code control, unit testing, and builds, while delivering key functionalities such as Scoring, Bulk Support, Pick Cart Planning, and Pack Wave in the Work Release and Slotting modules.
  • Suggested and implemented process improvements for estimating, development, and testing, along with algorithms to optimize processes and improve efficiency across components.
  • Reduced open tickets by categorizing, prioritizing, and analyzing issues, and successfully implemented key functionalities like Zone Aware JDM, Street Position, and Level Elevation Calculation for Layout.
  • Provided Go-Live support for Work Release and DCLayout, and contributed to Item Grouping, Balancing, and Slotting functionalities, ensuring smooth deployment and functionality.

Senior Software Engineer

Infrrd Private
Bangalore
01.2015 - 12.2017
  • Worked on the Chrome River project, an intuitive expense management system that automates expense and invoice processes using new technologies, simplifying business tasks with minimal training.
  • Developed various functional components as part of the Integration Team, including implementing direct pay remit files for BOFA, BMO, HSBC, JPMC, and BARC.
  • Integrated the Enterprise Travel Data System with Chrome River, improving overall data flow and system efficiency.
  • Provided production support, analyzing and resolving issues, preparing patches, and delivering them to the customer team, while working on PGP Encryption, Decryption, and Digital Signing.
  • Gained 7 months of onsite experience, managing the team, gathering requirements, and interacting with clients to ensure project success.

Software Engineer

NTT DATA FA Insurance Systems Private
Bangalore
07.2012 - 01.2015
  • Worked on FirstGen Neo, a web-based policy administration system that manages the entire insurance process from sale to the general ledger, supporting various types of insurance such as Motor, Fire, Marine, Liability, and PA. The system is multi-currency, multi-language, and highly flexible and scalable, allowing easy configuration and quick updates to business rules without programming changes.
  • Developed Web Service functionalities for Underwriting, Claims, Accounts, and Party modules, and enhanced the system with new features like Renewal Deselection, 360 Enquiry, and Risk Accumulation Enquiry.
  • Assisted in requirement analysis, CUT, and batch upload functionality improvements, including issue fixing and modifications.
  • Created server architecture for the production environment using WebLogic and Apache Tomcat, and managed Build & Delivery on development and test environments.
  • Provided production support, analyzing and resolving issues, preparing patches, and delivering them to the customer team.

Education

Bachelor of Engineering - Computer Science

Government College of Technology
Coimbatore
04.2018 - 04.2012

Skills

JAVA, Python, TypeScript, Node

HTML, CSS, JS, Angular, React

Spring, Webworks, ORM

MongoDB, MySQL, PostgreSQL, Oracle, DynamoDB

GCP, AWS, Docker, Kubernetes

Flask, Gunicorn, Terraform, Ansible

AutoML, Kubeflow, OAuth, JWT, CAS

Prometheus, Grafana, ELK Stack

Apache Kafka, Apache Airflow, Apache Spark

Continuous Integration, Continuous Deployment, Automated Testing, Version Control

Accomplishments

  • Bright Star Award, NTT Data FA Insurance Systems India Pvt. Ltd.
  • Best Performer Award, Raremile Technologies
  • Star Developer Award, Manhattan Associates
  • Medallion Award, Blume Global

Timeline

DevOps Engineer – Machine Learning

Blume Global
05.2020 - Current

Bachelor of Engineering - Computer Science

Government College of Technology
04.2018 - 04.2012

Technical Lead Engineer

Manhattan Associates
01.2018 - 04.2020

Senior Software Engineer

Infrrd Private
01.2015 - 12.2017

Software Engineer

NTT DATA FA Insurance Systems Private
07.2012 - 01.2015
Selvakumar SathapillaiML/AI DevOps Engineer