Summary
Overview
Work History
Education
Skills
Certification
Interests
Accomplishments
Timeline
Generic
RAJ KUMAR SINGH

RAJ KUMAR SINGH

Sr. AI Software Engineer
Bangalore

Summary

Senior AI & DevOps Engineer with 15+ years of experience in DevOps, cloud automation, and AI/ML model deployment. Skilled in EKS cluster management, Kubernetes orchestration, and multi-cloud AI inference workloads (AWS, Azure, GKE - moderate). Experienced in CI/CD automation (Jenkins, GitLab CI, Azure DevOps), Infrastructure as Code (Terraform, Ansible), and SRE best practices to enhance scalability, reliability, and security. Proficient in monitoring & logging (Prometheus, ELK, CloudWatch) and optimizing cloud infrastructure performance. Delivered projects in fintech, healthcare, and AI, including migrating workloads to Kubernetes, serverless microservices, and cloud security solutions. Passionate about mentoring teams, automating infrastructure, and driving DevOps adoption.

Overview

15
15
years of professional experience
8
8
Certificates
2
2
Languages

Work History

Sr. Software Engineer - AI (Prev. Sr. Software Engineer)

Mphasis Ltd.
03.2021 - Current
  • AI & Cloud Infrastructure: Supporting and optimizing generative AI-powered applications in a globally distributed environment. Managing AI inference workloads across multiple cloud providers, leveraging PyTorch and TensorFlow for model deployment. Responsible for deploying and maintaining EKS clusters, deploying applications for DigitalRisk and other AI services, and integrating AI models to provide scalable solutions. AI models are deployed on both EKS (Elastic Kubernetes Service) and GKE (Google Kubernetes Engine) clusters, ensuring high availability and performance in containerized environments.
  • Service Reliability: Led the design and implementation of DevOps practices to enhance service availability and performance across multi-cloud environments. Collaborated closely with service owners and architecture teams to improve service SLOs, automate incident responses, and minimize downtime.
  • Wealth Management (Fintech) Project: Spearheaded the project as a Sr. AWS DevSecOps Engineer, automating infrastructure tasks using Python scripts and third-party tools. Designed and implemented AWS serverless microservices using Lambda and Terraform, automating tasks with Python/Bash scripts.
  • VEDaaS Migration: Successfully migrated Verified Entity Data as a Service (VEDaaS) from EC2-classic to VPC for improved security and scalability, and transitioned UAT and PROD environments to Kubernetes (EKS) with Blue/Green deployment strategies.
  • Internal Geek Cloud Projects: Led DevOps initiatives for internal projects, including architecture design and implementation, mentoring teams in adopting DevOps culture, and facilitating recruitment processes.
  • AWS Serverless Breakfix CRM Project: Oversaw the end-to-end delivery, including infrastructure management, branching, pipeline, and automation tasks. Deployed a Dockerized React app for Breakfix CRM API monitoring on EC2.
  • Azure DevOps CI/CD for RRB Project: Implemented CI/CD pipelines for the RRB project on Azure DevOps Server, deploying on AWS EC2 instances using CodeDeploy and autoscaling configurations.
  • Monitoring & Automation: Automated OpenSearch and host record comparison for monitoring with Python Lambda functions. Developed Python scripts for GitLab pipeline automation, handling package retrieval and notification triggers.
  • Infrastructure Automation: Developed an ECS Fargate cluster triggered by Lambda to generate reports and handle long-running tasks. Automated infrastructure deployments using Terraform, Python, and Bash scripting.
  • DevOps Architecture: Validated Amazon MWAA, Airbyte on EKS, and Data Lake solutions for CSV data processing, following GMP scope requirements.
  • Containerization & Orchestration: Deployed containerized FHIR servers on EC2 using Jenkins, and on ECS clusters with CodePipeline across multiple accounts. Created and managed Tekton pipelines on an OpenShift Kubernetes cluster on GCP, deploying applications to GCP Cloud Run.
  • Monitoring & Observability: Configured and optimized monitoring solutions using Prometheus and ELK stacks for critical, high-performance services.
  • Enhanced software functionality by identifying and resolving complex technical issues.
  • Mentored junior developers, fostering professional growth and enhancing team productivity.
  • Developed scalable applications using agile methodologies for timely project delivery.
  • Managed multiple projects simultaneously while maintaining strict deadlines and high-quality standards.

Sr. Software Engineer

Locuz
01.2019 - 03.2021
  • Managed DevOps & HPC development teams, driving the adoption of DevOps practices.
  • Led multiple cloud and IT automation projects, including migrations and security assessments.
  • Migrated Joomla to WordPress and MySQL to MySQL RDS on AWS for Project Management Institute using Terraform.
  • Transitioned Healthcare portal to LightSail (Dev instance) using Terraform.
  • Moved Amusement Park portal to AWS cloud (EC2 autoscaling) with Terraform and Jenkins, migrating Java, .Net core, and Oracle forms projects.
  • Updated HPC suite (Ganana) to run on RHEL 8 for high-performance computing.
  • Developed Cloud (AWS) security assessment portal with Python and Django, scanning AWS resources according to CIS Level 2 benchmark and generating remediation reports.
  • Maintained comprehensive documentation of development work, facilitating knowledge sharing among team members.
  • Regularly reviewed peers'' code contributions, offering constructive feedback to enhance overall product quality.
  • Collaborated with cross-functional teams to design innovative software solutions.
  • Proactively identified areas for process improvement, implementing changes that led to significant time savings for the team.

Operation Associate

Tata Hitachi
09.2010 - 01.2019
  • Collaborated with cross-functional teams to design and deploy systems, implementing improvements for enhanced efficiency.
  • Led the migration of internal projects to the cloud and managed IT administration tasks.
  • Oversaw daily operations, ensuring timely completion of tasks and adherence to company policies and procedures.
  • Improved operational efficiency by implementing new processes and streamlining existing workflows.
  • Maintained and prioritized to-do-list and followed up to complete tasks on-time.
  • Resolved customer issues promptly and professionally, maintaining high satisfaction rates while minimizing escalations.

Education

Master of computer application -

IGNOU
New Delhi
06.2010

Bachelor of computer Application - undefined

IGNOU
New Delhi
06.2008

Skills

Cloud & Infrastructure

Certification

AWS Certified Solution Architect - Associate

Interests

Sports (Football, Badminton), Guitar, Travelling, Programming, Watching current-affairs, science & tech, and farming blogs

Accomplishments

Accolade Appreciation: Received Pinnacle Award for successfully driving the team to architect, build and deploy the EKS clusters and deploy the applications for DigitalRisk project

Timeline

Sr. Software Engineer - AI (Prev. Sr. Software Engineer)

Mphasis Ltd.
03.2021 - Current

Sr. Software Engineer

Locuz
01.2019 - 03.2021

Operation Associate

Tata Hitachi
09.2010 - 01.2019

Bachelor of computer Application - undefined

IGNOU

Master of computer application -

IGNOU
RAJ KUMAR SINGHSr. AI Software Engineer