Summary
Overview
Work History
Education
Skills
Websites
Certification
Languages
Timeline
Generic
Shikhar  Gupta

Shikhar Gupta

Bangalore

Summary

Experienced and results-driven Senior DevOps Engineer with a strong background in Kubernetes, observability, cloud infrastructure, and automation. Proven track record of improving system reliability, reducing operational toil, and enabling faster deployments through efficient tooling and workflows. Adept at managing end-to-end lifecycle of clusters, enhancing monitoring systems using Grafana and Prometheus, and collaborating cross-functionally to support multi-tenant environments. Passionate about building scalable DevOps solutions and mentoring others through hands-on technical guidance, coding best practices, and automation excellence.

Overview

7
7
years of professional experience
1
1
Certification

Work History

Senior DevOps Engineer

NVIDIA
06.2021 - Current
  • Automated Cluster Validation: Designed and implemented an automation workflow for cluster validation on BCM (Bare-Metal Cloud) in OCI, cutting down the manual validation cycle from multiple days to half a day, significantly accelerating customer handoff timelines.
  • Improved Observability Stack for Kubernetes-aaS: Led initiatives to stabilize and enhance the observability stack for Kubernetes-as-a-Service, improving its resilience, fault tolerance, and reliability across multi-tenant environments.
  • Monitoring Stack Evaluation & Strategy: Conducted in-depth evaluation of Grafana Cloud, Panoptes, and other observability tools, influencing key architectural decisions around monitoring, alerting, and incident response.
  • On-Call Toil Reduction: Actively worked on reducing operational toil by improving on-call workflows, enhancing tooling, and taking ownership of incident management processes—resulting in smoother and faster recovery during outages.
  • Multi-Tenant Cluster Build Support: Provided end-to-end support for Kubernetes cluster provisioning and management for multiple tenants, ensuring seamless environment setup and minimizing onboarding friction.
  • Grafana Dashboards & Tooling Development: Built custom Grafana dashboards and monitoring tools tailored to team and tenant-specific use cases, improving visibility and alerting coverage.
  • AI Agent for SRE Efficiency: Created an AI-based agent bot that assists SREs by analyzing alerts and offering contextual insights, reducing mean time to detection (MTTD) and improving response efficiency during incidents.

Mentor

Newton School
01.2021 - 05.2021
  • Mentored College Students in Programming: Taught core programming concepts and industry best practices to 3rd and 4th-year engineering students as part of a freelance mentoring engagement with Newton School.
  • Conducted Hands-On Sessions in Java & Git: Delivered practical sessions covering Java fundamentals, version control using Git, and real-world software development workflows to prepare students for industry roles.
  • Focused on Foundational & Applied Learning: Helped students build a strong foundation in coding logic, object-oriented programming, and collaborative development practices aligned with current tech standards.

Software Engineer

CGI
08.2019 - 05.2021
  • Developed “Quizzo” Web Application: Created an auto-generating quiz engine using Angular 8 and Java Spring Boot, which dynamically generates quizzes from Wikipedia pages. This was built during the training program at NIIT StackRoute, Koramangala.
  • Implemented Full-Stack Architecture: Designed and developed a RESTful backend server to manage and store user and quiz data in an online database, integrating both frontend and backend components seamlessly.
  • Explored Modern Technologies: Gained hands-on experience with Spring Boot, Angular 8, and Microservices architecture, building scalable and modular components for the Quizzo application.
  • End-to-End Project Ownership: worked on the development and maintenance of a mutual fund web application for a client, handling complete frontend and backend responsibilities including UI design, API development, and database integration.

Futures First

Trader
05.2018 - 08.2018
  • Applied Technical Indicators for Market Analysis: Utilized various technical indicators such as Moving Averages, Relative Strength Index (RSI), and trend analysis techniques to study and predict market behavior and momentum.
  • Developed ML Model for Price Prediction: Implemented a machine learning algorithm to forecast the future price trends of a cotton derivative, leveraging historical data and features extracted from technical indicators.

Education

Bachelor of Science - Chemical Engineering

IIT BHU (Varanasi)
05-2019

No Degree - Science And Computer

City Montessori School Lucknow
Lucknow
03-2014

Skills

  • Spring Boot
  • angular js
  • react js
  • Spring MVC
  • Machine learning NLP
  • Microservices
  • Java,C
  • core java
  • MySQL
  • MongoDB
  • Docker
  • Jenkins
  • jira
  • Agile development
  • Kubernetes
  • argo cd
  • prompt engineering

Certification

CKA – Certified Kubernetes Administrator

  • Offered by: The Linux Foundation & CNCF
  • Focus: Cluster administration, networking, storage, security, troubleshooting, etc.
  • Ideal for: DevOps engineers, SREs, system administrators

Languages

English
Full Professional
Hindi
Native or Bilingual

Timeline

Senior DevOps Engineer

NVIDIA
06.2021 - Current

Mentor

Newton School
01.2021 - 05.2021

Software Engineer

CGI
08.2019 - 05.2021

Futures First

Trader
05.2018 - 08.2018

Bachelor of Science - Chemical Engineering

IIT BHU (Varanasi)

No Degree - Science And Computer

City Montessori School Lucknow
Shikhar Gupta