Summary
Overview
Work History
Education
Skills
Accomplishments
Certification
LANGUAGES
Timeline
AssistantManager
HARDEEP SINGH

HARDEEP SINGH

Bengaluru

Summary

Results-driven DevOps Engineer with 9+ years of experience architecting, automating, and optimizing large-scale cloud infrastructures across AWS, Azure, and GCP. Skilled in designing secure, high-availability environments with Kubernetes, Terraform, and CI/CD pipelines, while driving continuous delivery, performance, and operational excellence across enterprise platforms.

Combining deep DevOps expertise with emerging AI-driven practices, I bring hands-on experience in MLOps, LLMOps, and AIOps—from deploying and scaling ML pipelines to managing LLM workloads on Kubernetes, integrating observability, and implementing intelligent alerting and anomaly detection. Known for bridging data science and infrastructure teams, I enable faster experimentation, reliable model delivery, and reduced downtime through automation and predictive monitoring.

Recognized for a proactive, innovation-focused approach, I help organizations achieve scalable, intelligent, and self-healing infrastructure systems, ensuring agility, reliability, and measurable business impact.

Overview

10
10
years of professional experience
5
5
Certification

Work History

Senior DevOps Engineer

BlueAlly
11.2023 - Current
  • AI-Driven DevOps Modernization: Designed and implemented intelligent CI/CD pipelines that leverage machine learning models to predict deployment risks, reducing rollback incidents by 35% and improving release reliability.
  • MLOps Pipeline Automation: Architected a scalable MLOps framework using Kubeflow, MLflow, and Argo Workflows for Genentech’s internal AI projects, enabling seamless model training, testing, and deployment across hybrid environments.
  • LLMOps Infrastructure Enablement: Built and optimized GPU-powered Kubernetes clusters for Large Language Model (LLM) training and fine-tuning workflows, cutting model inference latency by 45% while ensuring high availability.
  • AIOps-Based Monitoring: Implemented predictive alerting using AIOps principles with Prometheus, Grafana, and anomaly detection models, resulting in a 40% drop in false alerts and faster root cause analysis.
  • Cloud & Automation Excellence: Managed hybrid infrastructure across AWS, Azure, and GCP, applying Infrastructure-as-Code (Terraform, Helm, ArgoCD) for rapid environment provisioning and zero-downtime upgrades.
  • Cross-Domain Collaboration: Partnered with ML engineers and data scientists to align DevOps and MLOps workflows—accelerating model delivery cycles and enhancing observability across the ML lifecycle.
  • Cost Optimization via AI Insights: Integrated AI-based utilization forecasting for cloud workloads, saving ~22% annually in compute and storage costs.
  • Security & Compliance Automation: Automated secret rotation, access policies, and vulnerability scanning across AI/ML pipelines, ensuring compliance with Roche’s internal security standards.
  • Authored Terraform scripts for IAC strategies, leading to a 30% reduction in infrastructure provisioning time.
  • Reduced deployment costs by $500K annually through optimized resource allocation.
  • Managed 100+ cloud environments in multi-region setups.

DevOps Consultant

Strata Consulting, LLC
08.2023 - 01.2024
  • Reduced cloud infrastructure costs by 15% annually, resulting in savings of $200,000.
  • Improved application deployment time by 30%, resulting in faster time-to-market for critical projects.
  • Managed deployment pipelines for 50+ applications across multiple environments, with zero downtime.
  • Collaborated on a cloud migration project that included microservices architecture, resulting in a 20% increase in system resilience.
  • Implemented Helm for Kubernetes management, which significantly improved deployment processes and downtime during updates.
  • Managed a large-scale GIT migration project further improving version control practices, impacting over 200 development staff.
  • Introduced DevSecOps practices including static code analysis and IAC scanning, heightening security postures across the SDLC.
  • Automated build and release cycles using Azure DevOps, enhancing productivity by 15% across the development teams.

DevOps Lead

AgiliTech Software Services Private Limited
06.2016 - 07.2023
  • Increased system reliability by 30% by implementing advanced monitoring solutions across cloud infrastructure.
  • Developed and maintained scalable CI/CD pipelines, which improved software iteration speed by 30%.
  • Modernized legacy systems to a containerized approach using Docker, boosting deployment efficiency.
  • Played a pivotal role in establishing a centralized logging solution with ELK Stack, enhancing system observability.
  • Carried out performance tuning of .NET applications, which yielded a 10% improvement in transaction processing.

DevOps Engineer

Ola (ANI Technologies Pvt. Ltd)
04.2017 - 05.2018
  • Automated CI build and deployment processes for diverse projects.
  • Collaborated with Architecture, Development, Test, Security, and IT teams.
  • Developed solutions for app deployment, monitoring, and testing.
  • Migrate on-premises infrastructure to AWS cloud, ensuring seamless data transition.
  • Deploy and manage AWS environment (EC2, ELB, S3, VPC, RDS).

Assistant Engineer

Hindustan Dorr Oliver Limited
05.2015 - 04.2017
  • Install and support Cisco/IBM/Dell servers and storage solutions.
  • Migrate on-premises infrastructure to AWS cloud, ensuring seamless data transition.
  • Deploy and manage AWS environment (EC2, ELB, S3, VPC, RDS).
  • Maintain 24/7 system issue resolution, ensuring high availability.

Education

BE - Mechanical Engineer

Annamalai University
08.2014

Skills

  • AWS
  • Azure
  • GCP
  • Hybrid & Multi-Cloud
  • Terraform
  • Pulumi
  • AWS CDK
  • Docker
  • Kubernetes
  • Helm
  • Service Mesh
  • Istio
  • Linkerd
  • GitHub Actions
  • ArgoCD
  • Jenkins
  • GitLab CI
  • IAM
  • Zero Trust
  • OPA
  • HashiCorp Vault
  • SAST/DAST
  • Prometheus
  • Grafana
  • OpenTelemetry
  • ELK Stack
  • Kubeflow
  • MLflow
  • AWS DevOps Guru
  • Python
  • Go
  • Bash
  • GitOps
  • MLOPS
  • Containerization technologies
  • Scripting languages
  • Database administration
  • Incident management
  • LLMOPS
  • ALOPS

Accomplishments

  • AI Infrastructure Modernization: Designed and deployed an end-to-end MLOps pipeline using Kubernetes, MLflow, and Airflow—reducing model deployment time by 40% and improving reproducibility across environments.
  • LLM Deployment Automation: Led the implementation of LLMOps workflows for fine-tuning and serving large language models (LLMs) on GPU-optimized Kubernetes clusters, achieving 50% faster inference performance and cost-efficient scaling.
  • Predictive Monitoring with AIOps: Built intelligent monitoring using Prometheus, Grafana, and anomaly detection with AI models, proactively preventing downtime and reducing alert noise by 35%.
  • Data-Driven CI/CD Optimization: Integrated AI-based regression analysis into CI/CD pipelines to auto-detect deployment anomalies, improving release stability by 30%.
  • Cloud Cost Optimization via AI Insights: Implemented an AI-powered cost analysis system that identified underutilized resources and cut annual cloud spending by 20%.
  • Lead Cloud Migration: Pioneered a successful cloud migration for a critical online platform, resulting in a 20% increase in performance and resilience.
  • Optimized Deployment Efficiency: Reduced deployment time by 30% through CI/CD pipeline automation.
  • Efficiency Award: Received an Efficiency Award for automating software delivery processes, reducing manual software build time by 60%.
  • Cost Reduction Champion: Formulated a cost-saving strategy by optimizing cloud resources, cutting infrastructure expenses by 25% yearly.
  • Security Implementation Lead: Directed a comprehensive security overhaul which decreased vulnerability occurrences by 35% over the course of one year.

Certification

  • AWS Certified Solutions Architect – Professional, Amazon Web Services (AWS)
  • AWS Certified DevOps Engineer – Professional, Amazon Web Services (AWS)
  • AWS Certified Solutions Architect – Associate, Amazon Web Services (AWS)
  • Microsoft Certified: Azure Administrator Associate
  • Microsoft Certified: DevOps Engineer Expert

LANGUAGES

English - Native
Hindi - Native

Timeline

Senior DevOps Engineer

BlueAlly
11.2023 - Current

DevOps Consultant

Strata Consulting, LLC
08.2023 - 01.2024

DevOps Engineer

Ola (ANI Technologies Pvt. Ltd)
04.2017 - 05.2018

DevOps Lead

AgiliTech Software Services Private Limited
06.2016 - 07.2023

Assistant Engineer

Hindustan Dorr Oliver Limited
05.2015 - 04.2017

BE - Mechanical Engineer

Annamalai University
HARDEEP SINGH